SEQUENCE LISTING 



< 1 1 0 > Punnonen , Juha 
Apt, Doris 
Neighbors , Margaret 
Leong, Steven R. 

<120> NOVEL TUMOR -ASSOCIATED ANTIGENS 



<130> 0334.210US 

<140> US 10/828,559 
<141> 2004-04-19 

<150> US 60/464,780 
<151> 2003-04-22 

<160> 95 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 185 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-25 fragment comprising extra-cellular domain 
(ECD) 



<400> 1 



Arg 


He 


Lys 


Pro 


Glu 


Gly 


Ala 


Leu 


Gin 


Asn 


Asn 


Asp 


Gly 


Leu 


Tyr 


Asp 


1 








5 










10 










15 




Pro 


Asp 


Cys 


Asp 
20 


Glu 


Ser 


Gly 


Leu 


Phe 
25 


Lys 


Ala 


Lys 


Gin 


Cys 
30 


Asn 


Gly 


Thr 


Ala 


Thr 

35 


Cys 


Trp 


Cys 


Val 


Asn 
40 


Thr 


Ala 


Gly 


Val 


Arg 
45 


Arg 


Thr 


Asp 


Lys 


Asp 
50 


Thr 


Glu 


He 


Thr 


Cys 
55 


Ser 


Glu 


Arg 


Val 


Arg 
60 


Thr 


Tyr 


Trp 


He 


He 


He 


Glu 


Leu 


Lys 


His 


Lys 


Glu 


Arg 


Glu 


Ser 


Pro 


Tyr 


Asp 


Ser 


Lys 


65 










70 










75 










80 


Ser 


Leu 


His 


Thr 


Ala 
85 


Leu 


Gin 


Lys 


Glu 


He 
90 


Thr 


Thr 


Arg 


Tyr 


Gin 
95 


Leu 


Asp 


Pro 


Lys 


Phe 
100 


He 


Thr 


Ser 


He 


Leu 
105 


Tyr 


Glu 


Asn 


Asn 


Val 
110 


He 


Thr 


He 


Asp 


Leu 
115 


Met 


Gin 


Asn 


Ser 


Ser 
120 


Gin 


Lys 


Thr 


Gin 


Asp 
125 


Asp 


Val 


Asp 


He 


Ala 
130 


Asp 


Val 


Ala 


Tyr 


Tyr 
135 


Phe 


Glu 


Lys 


Asp 


Val 
140 


Lys 


Gly 


Glu 


Ser 


Leu 


Phe 


His 


Ser 


Lys 


Lys 


Met 


Asp 


Leu 


Arg 


Val 


Asn 


Gly 


Glu 


Leu 


Leu 


145 










150 










155 










160 


Asp 


Leu 


Asp 


Pro 


Gly 
165 


Gin 


Thr 


Leu 


He 


Tyr 
170 


Tyr 


Val 


Asp 


Glu 


Lys 
175 


Ala 


Pro 


•Glu 


Phe 


Ser 


Met 


Gin 


Gly 


Leu 


Lys 

















180 185 



<210> 2 
<211> 57 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-25 fragment comprising propeptide (PP) 
<400> 2 

Gin Glu Glu Cys Val Cys Glu Asn Tyr Lys Leu Ala Val Asn Cys Phe 

15 10 15 

Val Asn Asn Asn Arg Glu Cys Gin Cys Thr Ser Val Gly Ala Gin Asn 

20 25 30 

Thr Val lie Cys Ser Lys Leu Ala Ala Lys Cys Leu Val Met Lys Ala 

35 40 45 

Glu Met Asn Gly Ser Lys Leu Gly Arg 
50 55 



<210> 3 

<211> 23 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-25 fragment comprising signal peptide (SP) 



<400> 3 

Met Ala Pro Pro Gin Ala Leu Ala 

1 5 
Thr Ala Thr Phe Ala Ala Ala 
20 



Leu Gly Leu Leu Leu Ala Ala Ala 
10 15 



<210> 4 
<211> 265 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-25 polypeptide (which comprises signal 
peptide+propeptide+ ECD) 

<400> 4 

Met Ala Pro Pro Gin Ala Leu Ala Leu Gly Leu Leu Leu Ala Ala Ala 

15 10 15 

Thr Ala Thr Phe Ala Ala Ala Gin Glu Glu Cys Val Cys Glu Asn Tyr 

20 25 30 

Lys Leu Ala Val Asn Cys Phe Val Asn Asn Asn Arg Glu Cys Gin Cys 

35 40 45 

Thr Ser Val Gly Ala Gin Asn Thr Val lie Cys Ser Lys Leu Ala Ala 

50 55 60 

Lys Cys Leu Val Met Lys Ala Glu Met Asn Gly Ser Lys Leu Gly Arg 
65 70 75 80 

Arg lie Lys Pro Glu Gly Ala Leu Gin Asn Asn Asp Gly Leu Tyr Asp 



- 2 - 



XT X U 


A 


v-y 0 






OCX 


Gly Leu 








1 no 

1UU 










X XXX 


A1 Fi 
r\..L CL 


± xxx 




irp 


y 0 


vai 


Asn 






lie: 
X x D 










ion 


T .vrc; 
i-i y o 


A C! 


Th-r 
x xxx 


01 u 


Tip 
11c 


Th-r 

X XXX 


Cys 


Ser 




-L j U 










1 7R 

X J J 




lie 


He 


Glu 


Leu 


Lys 


His 


Lys 


CjIU 












1DU 








uc u. 


n x 0 




AT a 

nXCt 


T .^i 1 

LlCU 


(jj±n 


Lys 










1 (TC 
1D3 








A. c*T~\ 


Pro 

IT X w 


j_i y 0 




IXC 


X XXX 


Ser 


lie 








1 ft n 
xou 










He 

-L -L ~ 


Len 
nop 


Leu 


Met 


ft! n 

OX XX 


A QTI 


Ser 


Ser 






195 










200 


He 


Ala 


Asp 


Val 


Ala 


Tyr 


Tyr 


Phe 




210 










215 




Leu 


Phe 


His 


Ser 


Lys 


Lys 


Met 


Asp 


225 










230 






Asp 


Leu 


Asp 


Pro 


Gly 


Gin 


Thr 


Leu 










245 








Pro 


Glu 


Phe 


Ser 


Met 


Gin 


Gly Leu 















-7 J 




Php 
±r lie 


Lys 


AT a 
Ala 


Lys 


bin 


Cys 


Asn 


uiy 


1 rm 










11U 






inr 


Ala 

Ala 


vjiy 


vai 


Arg 


Arg 


inr 


Asp 










IOC 

±Z b 








(jlU 


Arg 


vai 


Arg 


Tnr 


Tyr 


Trp 


lie 


















Arg 


LxlU 


Ser 


Pro 


Tyr 


Asp 


Ser 


Lys 






1 cc 
IjD 










luU 


blU 


Ti- 
ne 


ml, v 

Tnr 


Tnr 


Arg 


Tyr 


Gin 


Leu 




1 / u 










175 




Leu 


Tyr 


tjlU 


Asn 


Asn 


Val 


He 


inr 


X O D 










190 






Gin 


Lys 


Thr 


Gin 


Asp 


Asp 


Val 


Asp 










205 








Glu 


Lys 


Asp 


Val 


Lys 


Gly Glu 


Ser 








220 










Leu 


Arg 


Val 


Asn 


Gly 


Glu 


Leu 


Leu 






235 










240 


He 


Tyr 


Tyr 


Val 


Asp 


Glu 


Lys 


Ala 




250 










255 





Lys 
265 



<210> 5 
<211> 242 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-25 fragment comprising propeptide* ECD 



<400> 5 



Gin 


Glu 


Glu 


Cys 


Val 


Cys 


Glu 


Asn 


Tyr 


Lys 


Leu 


Ala 


Val 


Asn 


Cys 


Phe 


1 








5 










10 










15 




Val 


Asn 


Asn 


Asn 


Arg 


Glu 


Cys 


Gin 


Cys 


Thr 


Ser 


Val 


Gly 


Ala 


Gin 


Asn 








20 










25 










30 






Thr 


Val 


He 


Cys 


Ser 


Lys 


Leu 


Ala 


Ala 


Lys 


Cys 


Leu 


Val 


Met 


Lys 


Ala 






35 










40 










45 








Glu 


Met 


Asn 


Gly 


Ser 


Lys 


Leu 


Gly 


Arg 


Arg 


He 


Lys 


Pro 


Glu Gly 


Ala 




50 










55 










60 










Leu 


Gin 


Asn 


Asn 


Asp 


Gly 


Leu 


Tyr 


Asp 


Pro 


Asp 


Cys 


Asp 


Glu 


Ser 


Gly 


65 










70 










75 










80 


Leu 


Phe 


Lys 


Ala 


Lys 


Gin 


Cys 


Asn 


Gly 


Thr 


Ala 


Thr 


Cys 


Trp 


Cys 


Val 










85 










90 










95 




Asn 


Thr 


Ala 


Gly 


Val 


Arg 


Arg 


Thr 


Asp 


Lys 


Asp 


Thr 


Glu 


He 


Thr 


Cys 








100 










105 










110 






Ser 


Glu 


Arg 


Val 


Arg 


Thr 


Tyr 


Trp 


He 


He 


He 


Glu 


Leu 


Lys 


His 


Lys 






115 










120 










125 








Glu Arg 


Glu 


Ser 


Pro 


Tyr 


Asp 


Ser 


Lys 


Ser 


Leu 


His 


Thr 


Ala 


Leu 


Gin 




130 










135 










140 










Lys 


Glu 


He 


Thr 


Thr 


Arg 


Tyr 


Gin 


Leu 


Asp 


Pro 


Lys 


Phe 


He 


Thr 


Ser 


145 










150 










155 










160 


He 


Leu 


Tyr 


Glu 


Asn 


Asn 


Val 


He 


Thr 


He 


Asp 


Leu 


Met 


Gin 


Asn 


Ser 










165 










170 










175 




Ser 


Gin 


Lys 


Thr 


Gin 


Asp 


Asp 


Val 


Asp 


He 


Ala 


Asp 


Val 


Ala 


Tyr 


Tyr 
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180 185 190 

Phe Glu Lys Asp Val Lys Gly Glu Ser Leu Phe His Ser Lys Lys Met 

195 200 205 

Asp Leu Arg Val Asn Gly Glu Leu Leu Asp Leu Asp Pro Gly Gin Thr 

210 215 220 

Leu lie Tyr Tyr Val Asp Glu Lys Ala Pro Glu Phe Ser Met Gin Gly 
225 230 235 240 

Leu Lys 



<210> 6 
<211> 314 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-25 full- length/membrane -bound form, which 
comprises N- to C-terminus signal 
peptide+propeptide+ECD+TMD+CD 



<400> 6 



Met 


Ala 


Pro 


Pro 


Gin 


Ala 


Leu 


Ala 


Leu 


Gly 


Leu 


Leu 


Leu 


Ala 


Ala 


Ala 


1 








5 










10 










15 




Thr 


Ala 


Thr 


Phe 


Ala 


Ala 


Ala 


Gin 


Glu 


Glu 


Cys 


Val 


Cys 


Glu 


Asn 


Tyr 








20 










25 










30 






Lys 


Leu 


Ala 


Val 


Asn 


Cys 


Phe 


Val 


Asn 


Asn 


Asn 


Arg 


Glu 


Cys 


Gin 


Cys 






3 5 










40 










45 








Thr 


Ser 


Val 


Gly 


Ala 


Gin 


Asn 


Thr 


Val 


He 


Cys 


Ser 


Lys 


Leu 


Ala 


Ala 




50 










55 










60 










Lys 


Cys 


Leu 


Val 


Met 


Lys 


Ala 


Glu 


Met 


Asn 


Gly 


Ser 


Lys 


Leu Gly Arg 


65 










70 










75 










80 


Arg 


He 


Lys 


Pro 


Glu 


Gly 


Ala 


Leu 


Gin 


Asn 


Asn 


Asp 


Gly 


Leu 


Tyr 


Asp 










85 










90 










95 




Pro Asp 


Cys 


Asp 


Glu 


Ser 


Gly 


Leu 


Phe 


Lys 


Ala 


Lys 


Gin 


Cys Asn Gly 








100 










105 










110 






Thr 


Ala 


Thr 


Cys 


Trp 


Cys 


Val 


Asn 


Thr 


Ala 


Gly 


Val 


Arg 


Arg 


Thr 


Asp 






115 










120 










125 








Lys 


Asp 


Thr 


Glu 


He 


Thr 


Cys 


Ser 


Glu 


Arg 


Val 


Arg 


Thr 


Tyr 


Trp 


He 




130 










135 










140 










He 


He 


Glu 


Leu 


Lys 


His 


Lys 


Glu 


Arg 


Glu 


Ser 


Pro 


Tyr 


Asp 


Ser 


Lys 


145 










150 










155 










160 


Ser 


Leu 


His 


Thr 


Ala 


Leu 


Gin 


Lys 


Glu 


He 


Thr 


Thr 


Arg 


Tyr 


Gin 


Leu 










165 










170 










175 




Asp 


Pro 


Lys 


Phe 


He 


Thr 


Ser 


He 


Leu 


Tyr 


Glu 


Asn 


Asn 


Val 


He 


Thr 








180 










185 










190 






He 


Asp 


Leu 


Met 


Gin 


Asn 


Ser 


Ser 


Gin 


Lys 


Thr 


Gin 


Asp 


Asp 


Val 


Asp 






195 










200 










205 








He 


Ala 


Asp 


Val 


Ala 


Tyr 


Tyr 


Phe 


Glu 


Lys 


Asp 


Val 


Lys 


Gly Glu Ser 




210 










215 










220 










Leu 


Phe 


His 


Ser 


Lys 


Lys 


Met 


Asp 


Leu 


Arg 


Val 


Asn 


Gly 


Glu 


Leu 


Leu 


225 










230 










23 5 










240 


Asp 


Leu 


Asp 


Pro 


Gly 


Gin 


Thr 


Leu 


He 


Tyr 


Tyr 


Val 


Asp 


Glu 


Lys 


Ala 










245 










250 










255 




Pro 


Glu 


Phe 


Ser 


Met 


Gin 


Gly 


Leu 


Lys 


Ala 


Gly 


Val 


He 


Ala 


val 


He 








260 










265 










270 






Val 


Val 


Val 


Val 


Met 


Ala 


Val 


Val 


Ala 


Gly 


lie 


Val 


Val 


Leu 


Val 


He 
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275 280 285 

Ser Arg Lys Lys Arg Met Ala Lys Tyr Glu Lys Ala Glu lie Lys Glu 

290 295 300 

Met Gly Glu Met His Arg Glu Leu Asn Ala 
305 310 



<210> 7 

<211> 234 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mature Tag-25 polypeptide, which comprises N- to 
C- terminus ECD+TMD+CD 

<220> 

<221> DOMAIN 

<222> (186) . . . (208) 

<22 3> Transmembrane domain (TMD) 

<400> 7 



Arg lie 


Lys 


Pro 


Glu 


Gly 


Ala 


Leu 


Gin 


Asn 


Asn 


Asp 


Gly 


Leu 


Tyr 


Asp 


1 






5 










10 










15 




Pro Asp 


Cys 


Asp 


Glu 


Ser 


Gly 


Leu 


Phe 


Lys 


Ala 


Lys 


Gin 


Cys 


Asn 


Gly 






20 










25 










30 






Thr Ala 


Thr 


Cys 


Trp 


Cys 


Val 


Asn 


Thr 


Ala 


Gly 


Val 


Arg 


Arg 


Thr 


Asp 




35 










40 










45 








Lys Asp 


Thr 


Glu 


He 


Thr 


Cys 


Ser 


Glu 


Arg 


Val 


Arg 


Thr 


Tyr 


Trp 


He 


50 










55 










60 










He He 


Glu 


Leu 


Lys 


His 


Lys 


Glu 


Arg 


Glu 


Ser 


Pro 


Tyr 


Asp 


Ser 


Lys 


65 








70 










75 










80 


Ser Leu 


His 


Thr 


Ala 


Leu 


Gin 


Lys 


Glu 


He 


Thr 


Thr 


Arg 


Tyr 


Gin 


Leu 








85 










90 










95 




Asp Pro 


Lys 


Phe 


He 


Thr 


Ser 


He 


Leu 


Tyr 


Glu 


Asn 


Asn 


Val 


He 


Thr 






100 










105 










110 






He Asp 


Leu 


Met 


Gin 


Asn 


Ser 


Ser 


Gin 


Lys 


Thr 


Gin 


Asp 


Asp 


Val 


Asp 




115 










120 










125 








He Ala 


Asp 


Val 


Ala 


Tyr 


Tyr 


Phe 


Glu 


Lys 


Asp 


Val 


Lys 


Gly 


Glu 


Ser 


130 










135 










140 










Leu Phe 


His 


Ser 


Lys 


Lys 


Met 


Asp 


Leu 


Arg 


Val 


Asn 


Gly 


Glu 


Leu 


Leu 


145 








150 










155 










160 


Asp Leu 


Asp 


Pro 


Gly 


Gin 


Thr 


Leu 


He 


Tyr 


Tyr 


Val 


Asp 


Glu 


Lys 


Ala 








165 










170 










175 




Pro Glu 


Phe 


Ser 


Met 


Gin 


Gly 


Leu 


Lys 


Ala 


Gly 


Val 


lie 


Ala 


Val 


He 






180 










185 










190 






Val Val 


Val 


Val 


Met 


Ala 


Val 


Val 


Ala 


Gly 


He 


Val 


Val 


Leu 


Val 


He 




195 










200 










205 








Ser Arg 


Lys 


Lys Arg Met 


Ala 


Lys 


Tyr 


Glu 


Lys 


Ala 


Glu 


He 


Lys 


Glu 


210 










215 










220 










Met Gly 


Glu 


Met 


His 


Arg 


Glu 


Leu 


Asn 


Ala 















225 230 



<210> 8 
<211> 208 
<212> PRT 



- 5 - 



<213> Artificial Sequence 



<220> 

<223> Tag-25 fragment comprising ECD+TMD 



<400> 8 



Arg lie 


Lys 


Pro 


Glu 


Gly 


Ala 


Leu 


Gin 


Asn Asn Asp 


Gly 


Leu 


Tyr 


Asp 


1 






5 










10 






15 




Pro Asp 


Cys 


Asp 


Glu 


Ser 


Gly 


Leu 


Phe 


Lys Ala Lys 


Gin 


Cys 


Asn 


Gly 






20 










25 






30 






Thr Ala 


Thr 


Cys 


Trp 


Cys 


Val 


Asn 


Thr 


Ala Gly Val 


Arg 


Arg 


Thr 


Asp 




35 










40 






45 








Lys Asp 


Thr 


Glu 


He 


Thr 


Cys 


Ser 


Glu 


Arg Val Arg 


Thr 


Tyr 


Trp 


lie 


50 










55 






60 










He He 


Glu 


Leu 


Lys 


His 


Lys 


Glu 


Arg 


Glu Ser Pro 


Tyr 


Asp 


Ser 


Lys 


65 








70 








75 








80 


Ser Leu 


His 


Thr 


Ala 


Leu 


Gin 


Lys 


Glu 


He Thr Thr 


Arg 


Tyr 


Gin 


Leu 








85 










90 






95 




Asp Pro 


Lys 


Phe 


He 


Thr 


Ser 


He 


Leu 


Tyr Glu Asn 


Asn 


Val 


He 


Thr 






100 










105 






110 






He Asp 


Leu 


Met 


Gin 


Asn 


Ser 


Ser 


Gin 


Lys Thr Gin 


Asp 


Asp 


Val 


Asp 




115 










120 






125 








He Ala 


Asp 


Val 


Ala 


Tyr 


Tyr 


Phe 


Glu 


Lys Asp Val 


Lys 


Gly 


Glu 


Ser 


130 










135 






140 










Leu Phe 


His 


Ser 


Lys 


Lys 


Met 


Asp 


Leu 


Arg Val Asn 


Gly 


Glu 


Leu 


Leu 


145 








150 








155 








160 


Asp Leu 


Asp 


Pro 


Gly 


Gin 


Thr 


Leu 


He 


Tyr Tyr Val 


Asp 


Glu 


Lys 


Ala 








165 










170 






175 




Pro Glu 


Phe 


Ser 


Met 


Gin 


Gly 


Leu 


Lys 


Ala Gly Val 


He 


Ala 


Val 


He 






180 










185 






190 






Val Val 


Val 


Val 


Met 


Ala 


Val 


Val 


Ala 


Gly He Val 


Val 


Leu 


Val 


He 




195 










200 






205 








<210> 9 


























<211> 63 
























<212> PRT 
























<213> Artificial Sequence 
















<220> 


























<22 3> TAg-18 fragment comprising ECD 










<400> 9 


























Gin Asn Asp Val Asp 


He 


Ala 


Asp 


Val 


Ala His Tyr 


Phe 


Glu 


Lys 


Asp 


1 






5 










10 






15 




Val Lys 


Gly Glu 


Ser 


Leu 


Phe 


His 


Ser 


Ser Lys Lys 


Met 


Asp 


Leu 


Arg 






20 










25 






30 






Val Asn Gly Glu Gin Leu Asp 


Leu Asp 


Pro Gly Gin 


Thr 


Leu 


He 


Tyr 




35 










40 






45 








Tyr Val 


Asp Arg 


Asn 


Ala 


Pro 


Glu 


Phe 


Ser Met Gin 


Ala 


Leu 


Lys 




50 










55 






60 











<210> 10 
<211> 115 
<212> PRT 
<213> Artificial 



Sequence 



- 6 - 



<220> 

<22 3> TAg-18 ECD+TMD+CD 
<400> 10 



Gin Asn 


Asp 


Val 


Asp He Ala Asp 


Val 


Ala 


His 


Tyr 


Phe 


Glu 


Lys Asp 


1 






5 






10 










15 


Val Lys 


Gly 


Glu 


Ser Leu Phe 


His 


Ser 


Ser 


Lys 


Lys 


Met 


Asp 


Leu Arg 






20 






25 










30 




Val Asn 


Gly 


Glu 


Gin Leu Asp 


Leu 


Asp 


Pro 


Gly Gin 


Thr 


Leu 


He Tyr 




35 






40 










45 






Tyr Val 


Asp 


Arg 


Asn Ala Pro 


Glu 


Phe 


Ser 


Met 


Gin 


Ala 


Leu 


Lys Ala 


50 






55 










60 








Gly Val 


Cys 


Ala 


Val He Val 


Val 


Val 


Met 


He 


Ala 


Val 


Val 


Ala Gly 


65 






70 








75 








80 


He Val 


Val 


Leu 


Val He Ser 


Arg 


Lys 


Lys 


Arg 


Met 


Ala 


Lys 


Tyr Glu 








85 






90 










95 


Lys Ala 


Glu 


He 


Lys Glu Met 


Gly 


Arg 


Met 


His 


Arg 


Glu 


Leu 


Asn Ala 






100 






105 










110 




Ser Val 


Leu 
























115 






















<210> 11 






















<211> 29 






















<212> PRT 






















<213> Artificial Sequence 


















<220> 
























<223> TAg-18 CD- 


■like sequence 
















<400> 11 






















Ser Arg 


Lys 


Lys 


Arg Met Ala 


Lys 


Tyr 


Glu 


Lys 


Ala 


Glu 


He 


Lys Glu 


1 






5 






10 










15 


Met Gly Arg Met 


His Arg Glu 


Leu 


Asn 


Ala 


Ser 


Val 


Leu 










20 






25 














<210> 12 






















<211> 186 






















<212> PRT 






















<213> Artificial Sequence 


















<220> 
























<223> TAg-21 fragment comprising ECD 












<400> 12 






















Arg He 


Lys 


Pro Glu Gly Ala Leu 


Gin 


Asn 


Asn Asp 


Gly 


Leu 


Tyr Asp 


1 






5 






10 










15 


Pro Asp 


Cys 


Asp 


Glu Ser Gly Leu 


Phe 


Lys 


Ala 


Lys 


Gin 


Cys 


Asn Gly 






20 






25 










30 




Thr Ala 


Thr 


Cys 


Trp Cys Val 


Asn 


Thr 


Ala 


Gly Val 


Arg 


Arg 


Thr Asp 




35 






40 










45 






Lys Asp 


Thr 


Glu 


He Thr Cys 


Ser 


Glu Arg 


Val 


Arg 


Thr 


Tyr 


Trp He 


50 






55 










60 








He He 


Glu 


Leu 


Lys His Lys 


Glu 


Arg 


Glu 


Ser 


Pro 


Tyr 


Asp 


Ser Lys 


65 






70 








75 








80 
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Ser Leu His Thr 

Asp Pro Lys Phe 
100 

lie Asp Leu Met 
115 

He Ala Asp Val 
130 

Leu Phe His Ser 
145 

Leu Asp Leu Asp 

Ala Pro Glu Phe 
180 



Ala Leu Gin Lys 
85 

He Thr Ser He 

Gin Asn Ser Ser 
120 

Ala Tyr Tyr Phe 
135 

Ser Lys Lys Met 
150 

Pro Gly Gin Thr 
165 

Ser Met Gin Gly 



Glu He Thr Thr 
90 

Leu Tyr Glu Asn 
105 

Gin Lys Thr Gin 

Glu Lys Asp Val 
140 

Asp Leu Arg Val 
155 

Leu He Tyr Tyr 

170 
Leu Lys 
185 



Arg Tyr Gin Leu 
95 

Asn Val He Thr 
110 

Asp Asp Val Asp 
125 

Lys Gly Glu Ser 

Asn Gly Glu Leu 
160 

Val Asp Glu Lys 

175. 



<210> 13 
<211> 266 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-21 polypeptide (comprising SP+PP+ECD) 



<400> 13 



Met 


Ala 


Pro 


Pro 


Gin 


Ala 


Leu 


Ala 


Phe 


Gly 


Leu 


Leu 


Leu Ala 


Ala 


Ala 


1 








5 










10 








15 




Thr 


Ala 


Thr 


Phe 


Ala 


Ala 


Ala 


Gin 


Glu 


Glu 


Cys 


Val 


Cys Glu 


Asn 


Tyr 








20 










25 








30 






Lys 


Leu 


Ala 


Val 


Asn 


Cys 


Phe 


Val 


Asn 


Asn 


Asn 


Arg 


Glu Cys 


Gin 


Cys 






35 










40 










45 






Thr 


Ser 


Val 


Gly 


Ala 


Gin 


Asn 


Thr 


Val 


He 


Cys 


Ser 


Lys Leu 


Ala 


Ala 




50 










55 










60 








Lys 


Cys 


Leu 


Val 


Met 


Lys 


Ala 


Glu 


Met 


Asn 


Gly 


Ser 


Lys Leu 


Gly 


Arg 


65 










70 










75 








80 


Arg 


He 


Lys 


Pro 


Glu 


Gly 


Ala 


Leu 


Gin 


Asn 


Asn 


Asp 


Gly Leu Tyr 


Asp 










85 










90 








95 




Pro Asp 


Cys 


Asp 


Glu 


Ser 


Gly 


Leu 


Phe 


Lys 


Ala 


Lys 


Gin Cys 


Asn 


Gly 








100 










105 








110 






Thr 


Ala 


Thr 


Cys 


Trp 


Cys 


Val 


Asn 


Thr 


Ala 


Gly 


Val 


Arg Arg 


Thr 


Asp 






115 










120 










125 






Lys 


Asp 


Thr 


Glu 


He 


Thr 


Cys 


Ser 


Glu 


Arg 


Val 


Arg 


Thr Tyr 


Trp 


He 




130 










135 










140 








He 


He 


Glu 


Leu 


Lys 


His 


Lys 


Glu 


Arg 


Glu 


Ser 


Pro 


Tyr Asp 


Ser 


Lys 


145 










150 










155 








160 


Ser 


Leu 


Arg 


Thr 


Ala 


Leu 


Gin 


Lys 


Glu 


He 


Thr 


Thr 


Arg Tyr 


Gin 


Leu 










165 










170 








175 




Asp 


Pro 


Lys 


Phe 


He 


Thr 


Ser 


He 


Leu 


Tyr 


Glu 


Asn 


Asn Val 


He 


Thr 








180 










185 








190 






He 


Asp 


Leu 


Met 


Gin 


Asn 


Ser 


Ser 


Gin 


Lys 


Thr 


Gin 


Asn Asp 


Val 


Asp 






195 










200 










205 






He 


Ala 


Asp 


Val 


Ala 


His 


Tyr 


Phe 


Glu 


Lys 


Asp 


Val 


Lys Gly Glu 


Ser 




210 










215 










220 








Leu 


Phe 


His 


Ser 


Ser 


Lys 


Lys 


Met 


Asp 


Leu 


Arg 


Val 


Asn Gly Glu 


Gin 


225 










230 










235 








240 


Leu Asp 


Leu 


Asp 


Pro 


Gly 


Gin 


Thr 


Leu 


He 


Tyr 


Tyr 


Val Asp 


Arg 


Asn 










245 










250 








255 
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Ala Pro Glu Phe Ser Met Gin Ala Leu Lys 
260 265 



<210> 14 

<211> 289 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-21 extended polypeptide comprising 
SP+PP+ECD+TMD 



<400> 14 



Met 


Ala 


Pro Pro 


Gin 


Ala 


Leu 


Ala 


Phe Gly Leu 


Leu 


Leu 


Ala 


Ala 


Ala 


1 






5 








10 








15 




Thr 


Ala 


Thr Phe 

20 


Ala 


Ala 


Ala 


Gin 


Glu Glu Cys 
25 


Val 


Cys 


Glu 
30 


Asn 


Tyr 


Lys 


Leu 


Ala Val 


Asn Cys 


Phe 


Val 


Asn Asn Asn 


Arg Glu 


Cys 


Gin 


Cys 






35 








40 






45 








Thr 


Ser 


Val Gly Ala Gin Asn Thr 


Val He Cys 


Ser 


Lys 


Leu 


Ala 


Ala 




50 








55 






60 










Lys 


Cys 


Leu Val 


Met 


Lys 


Ala 


Glu 


Met Asn Gly 


Ser 


Lys 


Leu 


Gly 


Arg 


65 








70 






75 










8 0 


Arg 


He 


Lys Pro Glu Gly Ala 


Leu 


Gin Asn Asn 


Asp Gly 


Leu 


Tyr 


Asp 








85 








90 








95 




Pro Asp 


Cys Asp Glu Ser Gly Leu 


Phe Lys Ala 


Lys 


Gin 


Cys 


Asn 


Gly 






100 










105 






110 






Thr 


Ala 


Thr Cys 


Trp 


Cys Val Asn 


Thr Ala Gly 


Val 


Arg 


Arg 


Thr 


Asp 






115 








120 






125 








Lys 


Asp 
130 


Thr Glu 


He 


Thr 


Cys 
135 


Ser 


Glu Arg Val 


Arg 
140 


Thr 


Tyr 


Trp 


He 


He 


He 


Glu Leu Lys 


His 


Lys 


Glu 


Arg Glu Ser 


Pro Tyr 


Asp 


Ser 


Lys 


145 








150 






155 










160 


Ser 


Leu 


Arg Thr Ala 


Leu 


Gin 


Lys 


Glu He Thr 


Thr Arg 


Tyr 


Gin 


Leu 








165 








170 








175 




Asp 


Pro 


Lys Phe 
180 


He 


Thr 


Ser 


He 


Leu Tyr Glu 
185 


Asn 


Asn 


Val 
190 


He 


Thr 


He 


Asp 


Leu Met 
195 


Gin 


Asn 


Ser 


Ser 
200 


Gin Lys Thr 


Gin 


Asn 
205 


Asp 


Val 


Asp 


He 


Ala 
210 


Asp Val 


Ala 


His 


Tyr 
215 


Phe 


Glu Lys Asp 


Val 
220 


Lys 


Gly 


Glu 


Ser 


Leu 


Phe 


His Ser 


Ser 


Lys 


Lys 


Met 


Asp Leu Arg 


Val 


Asn 


Gly 


Glu 


Gin 


225 








230 






235 










240 


Leu Asp 


Leu Asp 


Pro Gly Gin Thr 


Leu He Tyr 


Tyr 


Val 


Asp 


Arg 


Asn 








245 








250 








255 




Ala 


Pro 


Glu Phe 
260 


Ser 


Met 


Gin 


Ala 


Leu Lys Ala 
265 


Gly 


He 


He 
270 


Ala 


Val 


He 


Val 


Val Val 
275 


Met 


He 


Ala 


Val 
280 


Val Ala Gly 


He 


Val 
285 


Val 


Leu 


Val 



He 



<210> 15 
<211> 23 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Tag-21 TMD 
<400> 15 

Ala Gly He He Ala Val He Val Val Val Met He Ala Val Val Ala 

15 10 15 

Gly He Val Val Leu Val He 
20 



<210> 16 
<211> 555 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding TAg-25 fragment comprising ECD domain 
<400> 16 

aggatcaaac ctgaaggagc tctgcagaac aacgatggtc tctacgaccc cgactgtgac 60 
gagtccggcc tcttcaaggc caaacagtgt aatggcactg ctacatgctg gtgcgtgaac 12 0 
accgctgggg tgcgccggac cgataaggat accgaaatta cctgttctga gagggtccgg 180 
acatattgga tcatcattga actcaaacat aaagagcgcg agtctccata cgattctaaa 240 
tccctccata ctgcactgca aaaggaaatc actacacgct accagctgga tccaaaattc 3 00 
attacatcca tcctctatga gaacaatgtt attacaattg atctgatgca aaatagctct 360 
cagaagactc aagacgacgt ggacatcgct gatgtggcct actattttga gaaggacgtt 42 0 
aagggggaat cactgttcca ttcaaagaaa atggatctga gggttaatgg cgagctgctg 480 
gacctggacc cagggcaaac cctgatctat tatgtggacg agaaggctcc agaattctct 54 0 
atgcaaggcc tgaag 555 

<210> 17 
<211> 171 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding TAg-2 5 fragment comprising propeptide 
<400> 17 

caggaggagt gtgtgtgcga aaactacaag ctcgctgtca actgtttcgt caacaataat 60 
agagaatgcc agtgcacttc tgtgggagca cagaatacag tgatctgtag caaactggct 120 
gcaaagtgtc tggtcatgaa ggccgaaatg aacggatcca agctcgggcg g 171 

<210> 18 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding TAg-25 fragment comprising SP 
<400> 18 

atggcacccc ctcaagcact ggcactgggt ctgctgctgg ccgccgctac cgccactttc 60 
gccgcagca 69 
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<210> 19 
<211> 795 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> DNA encoding TAg-2 5 polypeptide (which comprises 
signal peptide+propeptide+ ECD) 

<400> 19 

atggcacccc ctcaagcact ggcactgggt ctgctgctgg ccgccgctac cgccactttc 60 
gccgcagcac aggaggagtg tgtgtgcgaa aactacaagc tcgctgtcaa ctgtttcgtc 120 
aacaataata gagaatgcca gtgcacttct gtgggagcac agaatacagt gatctgtagc 180 
aaactggctg caaagtgtct ggtcatgaag gccgaaatga acggatccaa gctcgggcgg 240 
aggatcaaac ctgaaggagc tctgcagaac aacgatggtc tctacgaccc cgactgtgac 300 
gagtccggcc tcttcaaggc caaacagtgt aatggcactg ctacatgctg gtgcgtgaac 360 
accgctgggg tgcgccggac cgataaggat accgaaatta cctgttctga gagggtccgg 420 
acatattgga tcatcattga actcaaacat aaagagcgcg agtctccata cgattctaaa 480 
tccctccata ctgcactgca aaaggaaatc actacacgct accagctgga tccaaaattc 54 0 
attacatcca tcctctatga gaacaatgtt attacaattg atctgatgca aaatagctct 600 
cagaagactc aagacgacgt ggacatcgct gatgtggcct actattttga gaaggacgtt 660 
aagggggaat cactgttcca ttcaaagaaa atggatctga gggttaatgg cgagctgctg 72 0 
gacctggacc cagggcaaac cctgatctat tatgtggacg agaaggctcc agaattctct 780 
atgcaaggcc tgaag 795 

<210> 20 
<211> 726 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding TAg-25 fragment comprising PP+ECD 
<400> 20 

caggaggagt gtgtgtgcga aaactacaag ctcgctgtca actgtttcgt caacaataat 60 
agagaatgcc agtgcacttc tgtgggagca cagaatacag tgatctgtag caaactggct 120 
gcaaagtgtc tggtcatgaa ggccgaaatg aacggatcca agctcgggcg gaggatcaaa 180 
cctgaaggag ctctgcagaa caacgatggt ctctacgacc ccgactgtga cgagtccggc 240 
ctcttcaagg ccaaacagtg taatggcact gctacatgct ggtgcgtgaa caccgctggg 300 
gtgcgccgga ccgataagga taccgaaatt acctgttctg agagggtccg gacatattgg 36 0 
atcatcattg aactcaaaca taaagagcgc gagtctccat acgattctaa atccctccat 420 
actgcactgc aaaaggaaat cactacacgc taccagctgg atccaaaatt cattacatcc 480 
atcctctatg agaacaatgt tattacaatt gatctgatgc aaaatagctc tcagaagact 540 
caagacgacg tggacatcgc tgatgtggcc tactattttg agaaggacgt taagggggaa 600 
tcactgttcc attcaaagaa aatggatctg agggttaatg gcgagctgct ggacctggac 660 
ccagggcaaa ccctgatcta ttatgtggac gagaaggctc cagaattctc tatgcaaggc 720 
ctgaag 726 

<210> 21 
<211> 942 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding TAg-25 full - length/membrane bound 
form which comprises SP+PP+ECD+TMD+CD 
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<220> 

<221> misc_f eature 
<222> (796) . . . (864) 

<223> transmembrane domain (TMD) coding sequence 
<220> 

<221> misc_feature 
<222> (865) . . . (942) 

<223> cytoplasmic domain (CD) coding sequence 
<400> 21 

atggcacccc ctcaagcact ggcactgggt ctgctgctgg ccgccgctac cgccactttc 60 . 
gccgcagcac aggaggagtg tgtgtgcgaa aactacaagc tcgctgtcaa ctgtttcgtc 120 
aacaataata gagaatgcca gtgcacttct gtgggagcac agaatacagt gatctgtagc 180 
aaactggctg caaagtgtct ggtcatgaag gccgaaatga acggatccaa gctcgggcgg 240 
aggatcaaac ctgaaggagc tctgcagaac aacgatggtc tctacgaccc cgactgtgac 3 00 
gagtccggcc tcttcaaggc caaacagtgt aatggcactg ctacatgctg gtgcgtgaac 360 
accgctgggg tgcgccggac cgataaggat accgaaatta cctgttctga gagggtccgg 420 
acatattgga tcatcattga actcaaacat aaagagcgcg agtctccata cgattctaaa 480 
tccctccata ctgcactgca aaaggaaatc actacacgct accagctgga tccaaaattc 540 
attacatcca tcctctatga gaacaatgtt attacaattg atctgatgca aaatagctct 600 
cagaagactc aagacgacgt ggacatcgct gatgtggcct actattttga gaaggacgtt 660 
aagggggaat cactgttcca ttcaaagaaa atggatctga gggttaatgg cgagctgctg 720 
gacctggacc cagggcaaac cctgatctat tatgtggacg agaaggctcc agaattctct 780 
atgcaaggcc tgaaggctgg tgttattgct gttattgtgg ttgtggtgat ggcagttgtt 840 
gctggaattg ttgtgctggt tatttccaga aagaagagaa tggcaaagta tgagaaggct 900 
gagataaagg agatgggtga gatgcatagg gaactcaatg ca 942 

<210> 22 
<211> 702 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding TAg-25 Mature domain (which comprises 
ECD+TMD+CD of a full-length or membrane bound form 
of TAg-25) . 



<220> 

<221> misc_feature 
<222> (556) . . . (624) 

<223> transmembrane domain (TMD) coding sequence 
<400> 22 

aggatcaaac ctgaaggagc tctgcagaac aacgatggtc tctacgaccc cgactgtgac 60 
gagtccggcc tcttcaaggc caaacagtgt aatggcactg ctacatgctg gtgcgtgaac 12 0 
accgctgggg tgcgccggac cgataaggat accgaaatta cctgttctga gagggtccgg 180 
acatattgga tcatcattga actcaaacat aaagagcgcg agtctccata cgattctaaa 240 
tccctccata ctgcactgca aaaggaaatc actacacgct accagctgga tccaaaattc 300 
attacatcca tcctctatga gaacaatgtt attacaattg atctgatgca aaatagctct 360 
cagaagactc aagacgacgt ggacatcgct gatgtggcct actattttga gaaggacgtt 420 
aagggggaat cactgttcca ttcaaagaaa atggatctga gggttaatgg cgagctgctg 480 
gacctggacc cagggcaaac cctgatctat tatgtggacg agaaggctcc agaattctct 540 
atgcaaggcc tgaaggctgg tgttattgct gttattgtgg ttgtggtgat ggcagttgtt 600 
gctggaattg ttgtgctggt tatttccaga aagaagagaa tggcaaagta tgagaaggct 660 
gagataaagg agatgggtga gatgcatagg gaactcaatg ca 702 
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<210> 23 

<211> 624 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding TAg-25 fragment comprising ECD+TMD 
<220> 

<221> misc__f eature 

<222> (556) . . . (624) 

<223> transmembrane domain (TMD) coding sequence 



<400> 23 

aggatcaaac ctgaaggagc tctgcagaac aacgatggtc tctacgaccc cgactgtgac 60 
gagtccggcc tcttcaaggc caaacagtgt aatggcactg ctacatgctg gtgcgtgaac 120 
accgctgggg tgcgccggac . cgataaggat accgaaatta cctgttctga gagggtccgg 180 
acatattgga tcatcattga actcaaacat aaagagcgcg agtctccata cgattctaaa 240 
tccctccata ctgcactgca aaaggaaatc actacacgct accagctgga tccaaaattc 300 
attacatcca tcctctatga gaacaatgtt attacaattg atctgatgca aaatagctct 360 
cagaagactc aagacgacgt ggacatcgct gatgtggcct actattttga gaaggacgtt 420 
aagggggaat cactgttcca ttcaaagaaa atggatctga gggttaatgg cgagctgctg 480 
gacctggacc cagggcaaac cctgatctat tatgtggacg agaaggctcc agaattctct 540 
atgcaaggcc tgaaggctgg tgttattgct gttattgtgg ttgtggtgat ggcagttgtt 600 
gctggaattg ttgtgctggt tatt 624 

<210> 24 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> DNA sequence encoding TAg-21 fragment comprising 
SP 

<400> 24 

atggcacctc cccaggcact ggcatttgga ctgctgctgg ctgcagcaac cgccacattc 60 
gctgctgcc 69 



<210> 25 

<211> 171 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> DNA encoding TAg-21 fragment comprising propeptide 



<400> 25 

caggaggagt gtgtgtgtga gaactataaa ctggctgtca attgttttgt taataacaat 6 0 
agggagtgcc aatgtactag cgtgggagcc caaaacactg tcatttgctc caaactcgcc 12 0 
gccaaatgtc tcgtcatgaa agctgaaatg aatggtagca aactgggacg g 171 

<210> 26 
<211> 558 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> DNA encoding Tag-21 fragment comprising ECD 



<400> 26 

aggattaagc ccgaaggggc cctccagaac aatgacggac tctacgatcc agactgcgac 60 
gagagcgggc tgttcaaggc taagcagtgc aatggcaccg ccacctgttg gtgtgtgaat 120 
accgctggag tgcggcggac agacaaagac actgagatca cctgtagcga gagggtgcgc 180 
acttattgga tcatcattga actgaaacac aaggaacgcg aatccccata tgattccaag 240 
agcctgagga ccgccctcca gaaagagatc actactagat atcagctgga ccccaaattc 300 
atcaccagca ttctgtacga gaacaatgtc attacaatcg atctgatgca aaacagcagc 360 
cagaagaccc agaatgacgt ggacatcgcc gatgtggccc attattttga gaaagatgtc 42 0 
aagggggaat cactgttcca cagctccaag aagatggacc tgagagtgaa cggtgaacaa 480 
ctcgacctcg atcctgggca gacactgatc tactatgtcg acaggaatgc ccctgaattc 540 
agcatgcagg ccctgaag 558 

<210> 27 
<211> 867 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding TAg-21 extended polypeptide 
comprising SP+PP+ECD+TMD 

<220> 

<221> misc_feature 
<222> (70) . . . (240) 

<223> propeptide (PP) coding sequence 
<220> 

<221> misc_feature 
<222> (799) . . . (867) 

<223> transmembrane domain (TMD) coding sequence 
<400> 27 

atggcacctc cccaggcact ggcatttgga ctgctgctgg ctgcagcaac cgccacattc 60 
gctgctgccc aggaggagtg tgtgtgtgag aactataaac tggctgtcaa ttgttttgtt 12 0 
aataacaata gggagtgcca atgtactagc gtgggagccc aaaacactgt catttgctcc 180 
aaactcgccg ccaaatgtct cgtcatgaaa gctgaaatga atggtagcaa actgggacgg 240 
aggattaagc ccgaaggggc cctccagaac aatgacggac tctacgatcc agactgcgac 300 
gagagcgggc tgttcaaggc taagcagtgc aatggcaccg ccacctgttg gtgtgtgaat 360 
accgctggag tgcggcggac agacaaagac actgagatca cctgtagcga gagggtgcgc 420 
acttattgga tcatcattga actgaaacac aaggaacgcg aatccccata tgattccaag 480 
agcctgagga ccgccctcca gaaagagatc actactagat atcagctgga ccccaaattc 540 
atcaccagca ttctgtacga gaacaatgtc attacaatcg atctgatgca aaacagcagc 600 
cagaagaccc agaatgacgt ggacatcgcc gatgtggccc attattttga gaaagatgtc 660 
aagggggaat cactgttcca cagctccaag aagatggacc tgagagtgaa cggtgaacaa 720 
ctcgacctcg atcctgggca gacactgatc tactatgtcg acaggaatgc ccctgaattc 780 
agcatgcagg ccctgaaggc cggtatcatc gccgtgatcg tggttgttat gatcgccgtt 840 
gtggccggca tcgtcgtgct ggtgatc 867 

<210> 28 
<211> 345 
<212> DNA 

<213> Artificial Sequence 
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<223> DNA sequence encoding TAg-18 alternative fragment 
comprising ECD+TMD+CD 

<400> 28 

cagaatgatg tggacatagc tgatgtggct cattattttg aaaaagatgt taaaggtgaa 60 
tccttgtttc attcttctaa gaaaatggac ctgagagtaa atggagaaca actggatctg 12 0 
gatcctggtc aaactttaat ttattatgtt gatagaaatg cacctgaatt ttcaatgcag 180 
gctctaaaag ctggtgtttg tgctgttatt gtggttgtga tgatagcagt tgttgctgga 240 
attgttgtgc tggttatttc cagaaagaag agaatggcaa agtatgagaa ggctgagata 300 
aaggagatgg gtaggatgca tagggaactc aatgcatcag tccta 345 

<210> 29 
<211> 552 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA sequence encoding human EpCAM fragment 
comprising hEpCAM ECD 

<400> 29 

gcaaaacctg aaggggccct ccagaacaat gatgggcttt atgatcctga ctgcgatgag 60 
agcgggctct ttaaggccaa gcagtgcaac ggcacctcca cgtgctggtg tgtgaacact 12 0 
gctggggtca gaagaacaga caaggacact gaaataacct gctctgagcg agtgagaacc 180 
tactggatca tcattgaact aaaacacaaa gcaagagaaa aaccttatga tagtaaaagt 240 
ttgcggactg cacttcagaa ggagatcaca acgcgttatc aactggatcc aaaatttatc 300 
acgagtattt tgtatgagaa taatgttatc actattgatc tggttcaaaa ttcttctcaa 360 
aaaactcaga atgatgtgga catagctgat gtggcttatt attttgaaaa agatgttaaa 42 0 
ggtgaatcct tgtttcattc taagaaaatg gacctgacag taaatgggga acaactggat 480 
ctggatcctg gtcaaacttt aatttattat gttgatgaaa aagcacctga attctcaatg 540 
cagggtctaa aa 552 

<210> 30 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding hEpCAM fragment comprising signal 
peptide 

<400> 30 

atggcgcccc cgcaggtcct cgcgttcggg cttctgcttg ccgcggcgac ggcgactttt 60 
gcc 63 

<210> 31 
<211> 180 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding hEpCAM fragment comprising propeptide 
<400> 31 

gcagctcagg aagaatgtgt ctgtgaaaac tacaagctgg ccgtaaactg ctttgtgaat 60 
aataatcgtc aatgccagtg tacttcagtt ggtgcacaaa atactgtcat ttgctcaaag 120 
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ctggctgcca aatgtttggt gatgaaggca gaaatgaatg gctcaaaact tgggagaaga 180 



<210> 32 
<211> 266 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-18 chimeric polypeptide (comprising SP+PP+ECD) 
<400> 32 



Met 


Ala 


Pro 


Pro 


Gin 


Val 


Leu 


Ala 


Phe 


Gly 


Leu 


Leu 


Leu 


Ala 


Ala 


Ala 


1 








5 










10 










15 




Thr 


Ala 


Thr 


Phe 
20 


Ala 


Ala 


Ala 


Gin 


Glu 
25 


Glu 


Cys 


Val 


Cys 


Glu 

30 


Asn 


Tyr 


Lys 


Leu 


Ala 

35 


Val 


Asn 


Cys 


Phe 


Val 
40 


Asn 


Asn 


Asn 


Arg 


Gin 
45 


Cys 


Gin 


Cys 


Thr 


Ser 
50 


Val 


Gly 


Ala 


Gin 


Asn 
55 


Thr 


Val 


He 


Cys 


Ser 
60 


Lys 


Leu 


Ala 


Ala 


Lys 


Cys 


Leu 


Val 


Met 


Lys 


Ala 


Glu 


Met 


Asn 


Gly 


Ser 


Lys 


Leu Gly 


Arg 


65 










r-f /*\ 

70 










75 










80 


Arg 


Ala 


Lys 


Pro 


Glu 


Gly 


Ala 


Leu 


Gin 


Asn 


Asn 


Asp 


Gly Leu Tyr 


Asp 










ob 










90 










95 




Pro Asp 


Cys 


Asp 


Glu 


Ser 


Gly 


Leu 


Phe 


Lys 


Ala 


Lys 


Gin 


Cys Asn 


Gly 








100 










105 










110 






Thr 


Ser 


Thr 
115 


Cys 


Trp 


Cys 


Val 


Asn 
120 


Thr 


Ala 


Gly 


Val 


Arg 
125 


Arg 


Thr 


Asp 


Lys 


Asp 
130 


Thr 


Glu 


He 


Thr 


Cys 
135 


Ser 


Glu 


Arg 


Val 


Arg 
140 


Thr 


Tyr 


Trp 


He 


He 


He 


Glu 


Leu 


Lys 


His 


Lys 


Ala 


Arg 


Glu 


Lys 


Pro 


Tyr 


Asp 


Ser 


Lys 


145 










150 










155 










160 


Ser 


Leu 


Arg 


Thr 


Ala 
165 


Leu 


Gin 


Lys 


Glu 


He 
170 


Thr 


Thr 


Arg 


Tyr 


Gin 
175 


Leu 


Asp 


Pro 


Lys 


Phe 
180 


He 


Thr 


Ser 


He 


Leu 
185 


Tyr 


Glu 


Asn 


Asn 


Val 
190 


He 


Thr 


He Asp 


Leu 


Val 


Gin 


Asn 


Ser 


Ser 


Gin 


Lys 


Thr 


Gin 


Asn Asp Val 


Asp 






195 










200 










205 








He 


Ala 


Asp 


Val 


Ala 


His 


Tyr 


Phe 


Glu 


Lys 


Asp 


Val 


Lys Gly Glu 


Ser 




210 










215 










220 










Leu 


Phe 


His 


Ser 


Ser 


Lys 


Lys 


Met 


Asp 


Leu 


Arg 


Val 


Asn Gly Glu 


Gin 


225 










230 










235 










240 


Leu Asp 


Leu 


Asp 


Pro 


Gly 


Gin 


Thr 


Leu 


He 


Tyr 


Tyr 


Val 


Asp Arg 


Asn 










245 










250 










255 




Ala 


Pro 


Glu 


Phe 
260 


Ser 


Met 


Gin 


Ala 


Leu 
265 


Lys 















<210> 33 

<211> 798 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding TAg-18 chimeric polypeptide 
comprising SP+PP+ECD 
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<400> 33 

atggcgcccc cgcaggtcct cgcgttcggg cttctgcttg ccgcggcgac ggcgactttt 60 
gccgcagctc aggaagaatg tgtctgtgaa aactacaagc tggccgtaaa ctgctttgtg 12 0 
aataataatc gtcaatgcca gtgtacttca gttggtgcac aaaatactgt catttgctca 180 
aagctggctg ccaaatgttt ggtgatgaag gcagaaatga atggctcaaa acttgggaga 240 
agagcaaaac ctgaaggggc cctccagaac aatgatgggc tttatgatcc tgactgcgat 300 
gagagcgggc tctttaaggc caagcagtgc aacggcacct ccacgtgctg gtgtgtgaac 360 
actgctgggg tcagaagaac agacaaggac actgaaataa cctgctctga gcgagtgaga 420 
acctactgga tcatcattga actaaaacac aaagcaagag aaaaacctta tgatagtaaa 480 
agtttgcgga ctgcacttca gaaggagatc acaacgcgtt atcaactgga tccaaaattt 540 
atcacgagta ttttgtatga gaataatgtt atcactattg atctggttca aaattcttct 600 
caaaaaactc agaatgatgt ggacatagct gatgtggctc attattttga aaaagatgtt 660 
aaaggtgaat ccttgtttca ttcttctaag aaaatggacc tgagagtaaa tggagaacaa 72 0 
ctggatctgg atcctggtca aactttaatt tattatgttg atagaaatgc acctgaattt 780 
tcaatgcagg ctctaaaa 798 

<210> 34 
<211> 318 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-18 chimera extended polypeptide which 
comprises SP+PP+ECD+TMD+CD 

<400> 34 

Met Ala Pro Pro Gin Val Leu Ala Phe Gly Leu Leu Leu Ala Ala Ala 

15 10 15 

Thr Ala Thr Phe Ala Ala Ala Gin Glu Glu Cys Val Cys Glu Asn Tyr 

20 25 30 

Lys Leu Ala Val Asn Cys Phe Val Asn Asn Asn Arg Gin Cys Gin Cys 

35 40 45 

Thr Ser Val Gly Ala Gin Asn Thr Val lie Cys Ser Lys Leu Ala Ala 

50 . 55 60 

Lys Cys Leu Val Met Lys Ala Glu Met Asn Gly Ser Lys Leu Gly Arg 
65 70 75 80 

Arg Ala Lys Pro Glu Gly Ala Leu Gin Asn Asn Asp Gly Leu Tyr Asp 

85 90 95 

Pro Asp Cys Asp Glu Ser Gly Leu Phe Lys Ala Lys Gin Cys Asn Gly 

100 105 110 

Thr Ser Thr Cys Trp Cys Val Asn Thr Ala Gly Val Arg Arg Thr Asp 

115 120 125 

Lys Asp Thr Glu lie Thr Cys Ser Glu Arg Val Arg Thr Tyr Trp lie 

130 135 140 

lie lie Glu Leu Lys His Lys Ala Arg Glu Lys Pro Tyr Asp Ser Lys 
145 150 155 160 

Ser Leu Arg Thr Ala Leu Gin Lys Glu lie Thr Thr Arg Tyr Gin Leu 

165 170 175 

Asp Pro Lys Phe lie Thr Ser lie Leu Tyr Glu Asn Asn Val lie Thr 

180 185 190 

lie Asp Leu Val Gin Asn Ser Ser Gin Lys Thr Gin Asn Asp Val Asp 

195 200 205 

lie Ala Asp Val Ala His Tyr Phe Glu Lys Asp Val Lys Gly Glu Ser 

210 215 220 

Leu Phe His Ser Ser Lys Lys Met Asp Leu Arg Val Asn Gly Glu Gin 
225 230 235 240 

Leu Asp Leu Asp Pro Gly Gin Thr Leu lie Tyr Tyr Val Asp Arg Asn 
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Ala Pro Glu Phe 
260 

He Val Val Val 
275 

He Ser Arg Lys 
290 

Glu Met Gly Glu 
305 



245 

Ser Met Gin Ala 

Met He Ala Val 
280 

Lys Arg Met Ala 
295 

Met His Arg Glu 
310 



250 

Leu Lys Ala Gly 
265 

Val Ala Gly He 

Lys Tyr Glu Lys 
300 

Leu Asn Ala Ser 
315 



255 

Val Cys Ala Val 
270 

Val Val Leu Val 
285 

Ala Glu He Lys 
Val Leu 



<210> 35 

<211> 957 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA sequence encoding TAg-18 chimera (comprising 
SP+PP+ECD+TMD+CD) 



<400> 35 

atggcgcccc cgcaggtcct cgcgttcggg cttctgcttg ccgcggcgac ggcgactttt 60 
gccgcagctc aggaagaatg tgtctgtgaa aactacaagc tggccgtaaa ctgctttgtg 120 
aataataatc gtcaatgcca gtgtacttca gttggtgcac aaaatactgt catttgctca 180 
aagctggctg ccaaatgttt ggtgatgaag gcagaaatga atggctcaaa acttgggaga 240 
agagcaaaac ctgaaggggc cctccagaac aatgatgggc tttatgatcc tgactgcgat 3 00 
gagagcgggc tctttaaggc caagcagtgc aacggcacct ccacgtgctg gtgtgtgaac 360 
actgctgggg tcagaagaac agacaaggac actgaaataa cctgctctga gcgagtgaga 420 
acctactgga tcatcattga actaaaacac aaagcaagag aaaaacctta tgatagtaaa 480 
agtttgcgga ctgcacttca gaaggagatc acaacgcgtt atcaactgga tccaaaattt 540 
atcacgagta ttttgtatga gaataatgtt atcactattg atctggttca aaattcttct 600 
caaaaaactc agaatgatgt ggacatagct gatgtggctc attattttga aaaagatgtt 660 
aaaggtgaat ccttgtttca ttcttctaag aaaatggacc tgagagtaaa tggagaacaa 720 
ctggatctgg atcctggtca aactttaatt tattatgttg atagaaatgc acctgaattt 780 
tcaatgcagg ctctaaaagc tggtgtttgt gctgttattg tggttgtgat gatagcagtt 840 
gttgctggaa ttgttgtgct ggttatttcc agaaagaaga gaatggcaaa gtatgagaag 900 
gctgagataa aggagatggg taggatgcat agggaactca atgcatcagt cctataa 957 



<210> 36 

<211> 185 

<212> PRT 

<213> Homo sapiens 



<220> 

<223> Fragment of human EpCAM (hEpCAM) comprising hEpCAM 



ECD 



<400> 36 

Arg Ala Lys Pro Glu Gly Ala Leu 

1 5 
Pro Asp Cys Asp Glu Ser Gly Leu 
20 

Thr Ser Thr Cys Trp Cys Val Asn 

35 40 
Lys Asp Thr Glu He Thr Cys Ser 

50 55 
He He Glu Leu Lys His Lys Ala 



Gin Asn Asn Asp Gly Leu Tyr Asp 

10 15 
Phe Lys Ala Lys Gin Cys Asn Gly 
25 30 
Thr Ala Gly Val Arg Arg Thr Asp 
45 

Glu Arg Val Arg Thr Tyr Trp He 
60 

Arg Glu Lys Pro Tyr Asp Ser Lys 



- 18 - 



65 










70 






75 




80 


Ser 


Leu 


Arcr 


1 11 J. 


r\± CL 


ijtrU. ulH JLiy o 


ulU 


Tip 

lie 


i in 


Th >~ 
1 Ill 


my *-Y*- will JjcU 










85 






90 








Asp 


Pro 


uy o 


Php 

XT 11C 


Tip 


Thr Cor" Tip 

1 11 J- uCI lie 


lie u. 


iyr 


ul U. 


Asn 


Atoll vdl lie llli 








100 






105 








I 1 n 

I I \j 


lie Asp 


T.oi i 
UC u. 


V CL 1 


VjJ.ll 


7\ o-n Q o v Cat* 
<Md11 oci oci 


VJlil 


T .\rc 
liy fa 


l 111 


Vjrlll 


nbll ASp Vdl /ifap 






115 






120 










J. £» _> 


He 


Ala 


Asp 


Val 


Ala 


Tyr Tyr Phe 


Glu 


Lys 


Asp 


Val 


Lys Gly Glu Ser 




130 








135 








140 




Leu 


Phe 


His 


Ser 


Lys 


Lys Met Asp 


Leu 


Thr 


Val 


Asn 


Gly Glu Gin Leu 


145 










150 






155 




160 


Asp 


Leu 


Asp 


Pro 


Gly 


Gin Thr Leu 


He 


Tyr 


Tyr 


Val 


Asp Glu Lys Ala 










165 






170 






175 


Pro 


Glu 


Phe 


Ser 


Met 


Gin Gly Leu Lys 











180 185 



<210> 37 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Fragment of hEpCAM comprising hEpCAM signal 
peptide 

<400> 37 

Met Ala Pro Pro Gin Val Leu Ala Phe Gly Leu Leu Leu Ala Ala Ala 

15 10 15 

Thr Ala Thr Phe Ala Ala Ala 
20 



<210> 38 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Fragment of hEpCAM comprising hEpCAM propeptide 
<400> 38 

Gin Glu Glu Cys Val Cys Glu Asn Tyr Lys Leu Ala Val Asn Cys Phe 

1 5 10 15 

Val Asn Asn Asn Arg Gin Cys Gin Cys Thr Ser Val Gly Ala Gin Asn 

20 25 30 

Thr Val lie Cys Ser Lys Leu Ala Ala Lys Cys Leu Val Met Lys Ala 

35 40 45 

Glu Met Asn Gly Ser Lys Leu Gly Arg 
50 55 



<210> 39 

<211> 242 

<212> PRT 

<213> Homo sapiens 
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<220> 

<223> Fragment of hEpCAM comprising hEpCAM propeptide+ 
ECD 



<400> 39 



vain 


valU 


Glu 


Cys 


Hire* 

Vai L.yS 


valU 


Asn 


Tyr 


Lys 


Leu 


Aia 


Vai 


Asn 


\jys r*ne 


1 








5 








10 










15 


vai 


Asn 


Asn 


Asn 


Arg vain 


Cys 


Vain 


Cys 


Thr 


Ser 


vai 


Gly Ala 


Cain Asn 








20 








2 5 










3 0 




Thr 


vai 


He 


Cys 


Ser Lys 


Leu 


Aia 


Aia 


Lys 


Cys 


Leu 


Vai 


Met 


Lys Ala 






35 








40 










45 






Glu 


Met 


Asn 


Gly 


Ser Lys 


Leu 


Caly 


Arg 


Arg 


7V "I -» 

Ala 


Lys 


Pro 


Glu 


Gly Ala 




50 








55 










60 








Leu 


Gin 


Asn 


Asn 


Asp Gly 


Leu 


Tyr 


Asp 


Pro 


Asp 


Cys 


Asp 


Glu 


Ser Gly 


65 








70 










75 








80 


Leu 


Pne 


Lys 


Ala 


Lys Gin 


Cys 


Asn 


Gly 


Thr 


Ser 


Thr 


Cys 


Trp 


Cys Vai 










85 








90 










95 


Asn 


Thr 


Ala Gly 


Vai Arg 


Arg 


Thr 


Asp 


Lys 


Asp 


Thr 


Glu 


He 


Thr Cys 








100 








105 










110 




Ser 


Glu 


Arg 


Vai 


Arg Thr 


Tyr 


Trp 


He 


He 


He 


Glu 


Leu 


Lys 


His Lys 






115 








120 










125 






Ala 


Arg 


Glu 


Lys 


Pro Tyr 


Asp 


Ser 


Lys 


Ser 


Leu 


Arg 


Thr 


Ala 


Leu Gin 




130 








135 










140 








Lys 


Glu 


He 


Thr 


Thr Arg 


Tyr 


Gin 


Leu 


Asp 


Pro 


Lys 


Phe 


He 


Thr Ser 


145 








150 










155 








160 


He 


Leu 


Tyr 


Glu 


Asn Asn 


Vai 


He 


Thr 


He 


Asp 


Leu 


Vai 


Gin 


Asn Ser 










165 








170 










175 


Ser 


Gin 


Lys 


Thr 


Gin Asn 


Asp 


Vai 


Asp 


He 


Ala 


Asp 


Vai 


Ala 


Tyr Tyr 








180 








185 










190 




Phe 


Glu 


Lys 


Asp 


Vai Lys 


Gly 


Glu 


Ser 


Leu 


Phe 


His 


Ser 


Lys 


Lys Met 






195 








200 










205 






Asp Leu 


Thr 


Vai 


Asn Gly 


Glu 


Gin 


Leu 


Asp 


Leu 


Asp 


Pro 


Gly 


Gin Thr 




210 








215 










220 








Leu 


He 


Tyr 


Tyr 


Vai Asp 


Glu 


Lys 


Ala 


Pro 


Glu 


Phe 


Ser 


Met 


Gin Gly 


225 








230 










235 








240 



Leu Lys 



<210> 40 
<211> 265 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Fragment of hEpCAM comprising hEpCAM signal 
peptide+ propeptide+ECD (termed "sEpCAM") 

<400> 40 

Met Ala Pro Pro Gin Vai Leu Ala Phe Gly Leu Leu 

15 10 
Thr Ala Thr Phe Ala Ala Ala Gin Glu Glu Cys Vai 

20 25 
Lys Leu Ala Vai Asn Cys Phe Vai Asn Asn Asn Arg 

35 40 
Thr Ser Vai Gly Ala Gin Asn Thr Vai He Cys Ser 
50 55 60 



Leu Ala Ala Ala 
15 

Cys Glu Asn Tyr 
30 

Gin Cys Gin Cys 
45 

Lys Leu Ala Ala 
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Lys Cys Leu Val 
65 

Arg Ala Lys Pro 

Pro Asp Cys Asp 
100 

Thr Ser Thr Cys 
115 

Lys Asp Thr Glu 
130 

He He Glu Leu 
145 

Ser Leu Arg Thr 

Asp Pro Lys Phe 
180 

He Asp Leu Val 
195 

He Ala Asp Val 
210 

Leu. Phe His Ser 
225 

Asp Leu Asp Pro 

Pro Glu Phe Ser 
260 



Met Lys 


Ala 


m n 


70 






Glu Gly Ala 


T.^n 
JJC u. 


85 






Glu Ser Gly 


T .01 ^ 
Jjcu 


Trp Cys 


Val 


Asn 






120 


He Thr 


Cys 


Ser 




135 




Lys His 


Lys 


Ala 

Ala 


150 






Ala Leu 


Gin 


Lys 


165 






He Thr 


Ser 


lie 


Gin Asn 


Ser 


Ser 






onn 


Ala Tyr 


Tyr 


Phe 




215 




Lys Lys 


Met 


Asp 


230 






Gly Gin Thr 


Leu 


245 






Met Gin Gly 


Leu 



Met Asn Gly Ser 
75 

Gin Asn Asn Asp 
90 

Phe Lys Ala Lys 
105 

Thr Ala Gly Val 

Glu Arg Val Arg 
140 

Arg Glu Lys Pro 
155 

Glu He Thr Thr 
170 

Leu Tyr Glu Asn 
185 

Gin Lys Thr Gin 

Glu Lys Asp Val 
220 

Leu Thr Val Asn 
235 

He Tyr Tyr Val 
250 

Lys 
265 



Lys 


Leu 


vjiy 


Arg 








80 


Gly 


Leu 


Tyr 


Asp 










uin 


Cys 


Asn 


oiy 




1 1 n 






Arg 


Arg 


Tnr 


Asp 


IOC 








Thr 


Tyr 


Trp 


lie 


Tyr 


Asp 


Ser 


Lys 








160 


Arg 


Tyr 


Gin 


Leu 






i nc 
I/O 




Asn 


val 


lie 


rnl_ „ 

inr 




i q n 
l y u 






Asn 


Asp 


Val 


Asp 


205 








Lys 


Gly 


Glu 


Ser 


Gly 


Glu 


Gin 


Leu 








240 


Asp 


Glu 


Lys 


Ala 






255 





<210> 41 
<211> 314 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> WT full-length / membrane -bound hEpCAM (comprising 
SP+PP+ECD+TMD+CD domains) 

<220> 

<221> DOMAIN 

<222> (1) . . . (23) 

<223> signal peptide (SP) 

<220> 

<221> DOMAIN 

<222> (81) . . . (265) 

<223> extracellular domain (ECD) 

<220> 

<221> DOMAIN 

<222> (266) . . . (288) 

<223> transmembrane domain (TMD) 

<220> 

<221> DOMAIN 

<222> (289) . . . (314) 

<223> cytoplasmic domain (CD) 
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<400> 41 

Met Ala Pro Pro Gin Val Leu Ala Phe Gly Leu Leu Leu Ala Ala Ala 

1 5 10 15 

Thr Ala Thr Phe Ala Ala Ala Gin Glu Glu Cys Val Cys Glu Asn Tyr 

20 25 30 

Lys Leu Ala Val Asn Cys Phe Val Asn Asn Asn Arg Gin Cys Gin Cys 

35 40 45 

Thr Ser Val Gly Ala Gin Asn Thr Val lie Cys Ser Lys Leu Ala Ala 

50 55 60 

Lys Cys Leu Val Met Lys Ala Glu Met Asn Gly Ser Lys Leu Gly Arg 
65 70 75 80 

Arg Ala Lys Pro Glu Gly Ala Leu Gin Asn Asn Asp Gly Leu Tyr Asp 

85 90 95 

Pro Asp Cys Asp Glu Ser Gly Leu Phe Lys Ala Lys Gin Cys Asn Gly 

100 105 110 

Thr Ser Thr Cys Trp Cys Val Asn Thr Ala Gly Val Arg Arg Thr Asp 

115 120 125 

Lys Asp Thr Glu lie Thr Cys Ser Glu Arg Val Arg Thr Tyr Trp lie 

130 135 140 

lie lie Glu Leu Lys His Lys Ala Arg Glu Lys Pro Tyr Asp Ser Lys 
145 150 155 160 

Ser Leu Arg Thr Ala Leu Gin Lys Glu lie Thr Thr Arg Tyr Gin Leu 

165 170 175 

Asp Pro Lys Phe lie Thr Ser lie Leu Tyr Glu Asn Asn Val lie Thr 

180 185 190 

lie Asp Leu Val Gin Asn Ser Ser Gin Lys Thr Gin Asn Asp Val Asp 

195 200 205 

lie Ala Asp Val Ala Tyr Tyr Phe Glu Lys Asp Val Lys Gly Glu Ser 

210 215 220 

Leu Phe His Ser Lys Lys Met Asp Leu Thr Val Asn Gly Glu Gin Leu 
225 230 235 240 

Asp Leu Asp Pro Gly Gin Thr Leu lie Tyr Tyr Val Asp Glu Lys Ala 

245 250 255 

Pro Glu Phe Ser Met Gin Gly Leu Lys Ala Gly Val lie Ala Val lie 

260 265 270 

Val Val Val Val Met Ala Val Val Ala Gly He Val Val Leu Val He 

275 280 285 

Ser Arg Lys Lys Arg Met Ala Lys Tyr Glu Lys Ala Glu He Lys Glu 

290 295 300 

Met Gly Glu Met His Arg Glu Leu Asn Ala 
305 310 



<210> 42 
<211> 942 
<212> DNA 

<213> Homo sapiens 
<220> 

<22 3> DNA encoding WT full -length/membrane -bound hEpCAM 
(comprising signal peptide+propeptide+ECD+TMD+CD 
domains) 

<220> 

<221> misc_feature 

<222> (1) . . . (63) 

<22 3> coding sequence for SP 
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<220> 

<221> misc_feature 

<222> (244) . . . (795) 

<223> coding sequence for ECD 

<220> 

<221> misc_feature 

<222> (865) . . . (942) 

<223> coding sequence for CD 



<400> 42 

atggcgcccc cgcaggtcct cgcgttcggg cttctgcttg ccgcggcgac ggcgactttt 6 0 
gccgcagctc aggaagaatg tgtctgtgaa aactacaagc tggccgtaaa ctgctttgtg 12 0 
aataataatc gtcaatgcca gtgtacttca gttggtgcac aaaatactgt catttgctca 180 
aagctggctg ccaaatgttt ggtgatgaag gcagaaatga atggctcaaa acttgggaga 240 
agagcaaaac ctgaaggggc cctccagaac aatgatgggc tttatgatcc tgactgcgat 3 00 
gagagcgggc tctttaaggc caagcagtgc aacggcacct ccacgtgctg gtgtgtgaac 360 
actgctgggg tcagaagaac agacaaggac actgaaataa cctgctctga gcgagtgaga 420 
acctactgga tcatcattga actaaaacac aaagcaagag aaaaacctta tgatagtaaa 480 
agtttgcgga ctgcacttca gaaggagatc acaacgcgtt atcaactgga tccaaaattt 540 
atcacgagta ttttgtatga gaataatgtt atcactattg atctggttca aaattcttct 600 
caaaaaactc agaatgatgt ggacatagct gatgtggctt attattttga aaaagatgtt 660 
aaaggtgaat ccttgtttca ttctaagaaa atggacctga cagtaaatgg ggaacaactg 720 
gatctggatc ctggtcaaac tttaatttat tatgttgatg aaaaagcacc tgaattctca 780 
atgcagggtc taaaagctgg tgttattgct gttattgtgg ttgtggtgat ggcagttgtt 840 
gctggaattg ttgtgctggt tatttccaga aagaagagaa tggcaaagta tgagaaggct 900 
gagataaagg agatgggtga gatgcatagg gaactcaatg ca 942 

<210> 43 
<211> 234 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Mature domain of hEpCAM (comprising ECD+TMD+CD) 
<220> 

<221> DOMAIN 

<222> (186) . . . (208) 

<223> transmembrane domain (TMD) 



<400> 43 
Arg Ala Lys Pro 
1 

Pro Asp Cys Asp 
20 

Thr Ser Thr Cys 
35 

Lys Asp Thr Glu 
50 

lie lie Glu Leu 
65 

Ser Leu Arg Thr 

Asp Pro Lys Phe 
100 



Glu Gly Ala Leu 
5 

Glu Ser Gly Leu 

Trp Cys Val Asn 
40 

lie Thr Cys Ser 
55 

Lys His Lys Ala 
70 

Ala Leu Gin Lys 
85 

lie Thr Ser lie 



Gin Asn Asn Asp 
10 

Phe Lys Ala Lys 
25 

Thr Ala Gly Val 

Glu Arg Val Arg 

60 

Arg Glu Lys Pro 
75 

Glu lie Thr Thr 

• 90 

Leu Tyr Glu Asn 
105 



Gly Leu Tyr Asp 
15 

Gin Cys Asn Gly 
30 

Arg Arg Thr Asp 
45 

Thr Tyr Trp lie 

Tyr Asp Ser Lys 
80 

Arg Tyr Gin Leu 
95 

Asn Val He Thr 
110 
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T 1 A 

lie 


Asp 


Leu 


Val 


Gin 


Asn 


Ser 


Ser 


Cjin 


Lys 


Thr Gin 


Asn 


Asp 


val 


Asp 






115 










ion 
















Tl- 
116 


Ala 


Asp Val 


Ala 


Tyr 


Tyr 


Pne 


GlU 


Lys 


Asp Val 


Lys 


Gly 


GlU 


Ser 




1 JU 










13b 








14U 










Leu 


Phe 


nls 


Ser 


Lys 


Lys 


Met 


Asp 


Leu 


Thr 


Val Asn 


Gly 


GlU 


Gin 


Leu 


145 










150 










±OD 








lOU 


Asp 


Leu 


Asp 


Pro 


Gly Gin 


Thr 


Leu 


I le 


Tyr 


Tyr Va 1 


Asp 


Glu 


Lys 


Ala 










165 










-L / U 








J. / O 




Pro 


Glu 


Phe 


Ser 


Met 


Gin 


Gly 


Leu 


Lys 


Ala 


Gly Val 


He 


Ala 


Val 


He 








180 










185 








190 






Val 


Val 


Val 


Val 


Met 


Ala 


Val 


Val 


Ala 


Gly 


He Val 


Val 


Leu 


Val 


He 






195 










200 








205 








Ser Arg 


Lys 


Lys 


Arg 


Met 


Ala 


Lys 


Tyr 


Glu 


Lys Ala 


Glu 


He 


Lys 


Glu 




210 










215 








220 










Met 


Gly 


Glu 


Met 


His 


Arg 


Glu 


Leu 


Asn 


Ala 













225 230 



<210> 44 
<211> 699 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> DNA encoding mature domain of hEpCAM (comprising 
ECD+TMD+CD) 

<220> 

<221> misc_feature 

<222> (553) . . . (621) 

<223> coding sequence for TMD 

<400> 44 

gcaaaacctg aaggggccct ccagaacaat gatgggcttt atgatcctga ctgcgatgag 60 
agcgggctct ttaaggccaa gcagtgcaac ggcacctcca cgtgctggtg tgtgaacact 12 0 
gctggggtca gaagaacaga caaggacact gaaataacct gctctgagcg agtgagaacc 180 
tactggatca tcattgaact aaaacacaaa gcaagagaaa aaccttatga tagtaaaagt 240 
ttgcggactg cacttcagaa ggagatcaca acgcgttatc aactggatcc aaaatttatc 300 
acgagtattt tgtatgagaa taatgttatc actattgatc tggttcaaaa ttcttctcaa 360 
aaaactcaga atgatgtgga catagctgat gtggcttatt attttgaaaa agatgttaaa 420 
ggtgaatcct tgtttcattc taagaaaatg gacctgacag taaatgggga acaactggat 480 
ctggatcctg gtcaaacttt aatttattat gttgatgaaa aagcacctga attctcaatg 540 
cagggtctaa aagctggtgt tattgctgtt attgtggttg tggtgatggc agttgttgct 600 
ggaattgttg tgctggttat ttccagaaag aagagaatgg caaagtatga gaaggctgag 66 0 
ataaaggaga tgggtgagat gcatagggaa ctcaatgca 699 

<210> 45 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Fragment of hEpCAM comprising TMD 
<400> 45 

Ala Gly Val He Ala Val He Val Val Val Val Met Ala Val Val Ala 
15 10 15 
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Gly He Val Val Leu Val He 
20 



<210> 46 
<211> 26 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Fragment of hEpCAM CD 
<400> 46 

Ser Arg Lys Lys Arg Met Ala Lys Tyr Glu Lys Ala Glu He Lys Glu 

1 5 10 15 

Met Gly Glu Met His Arg Glu Leu Asn Ala 
20 25 



<210> 47 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope of hEpCAM in ECD 
<400> 47 

Gly Leu Tyr Asp Pro Asp Cys Asp Glu 
1 5 



48 
9 

PRT 

Artificial Sequence 
<220> 

<223> Epitope of hEpCAM in ECD 
<400> 48 

He Leu Tyr Glu Asn Asn Val He Thr 
1 5 



<210> 
<211> 
<212> 
<213> 



<210> 49 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope of hEpCAM in ECD 
<400> 49 

Tyr Gin Leu Asp Pro Lys Phe He Thr 
1 5 
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<210> 50 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope of hEpCAM in ECD 
<400> 50 

Tyr Gin Leu Asp Pro Lys Phe lie Thr Ser lie 
1 5 10 



<210> 51 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope of hEpCAM in ECD 
<400> 51 

Tyr Gin Leu Asp Pro Lys Phe lie Thr Ser lie Leu Tyr Glu Asn Asn 

1 5 10 15 

Val He Thr 



<210> 52 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope of hEpCAM in ECD 
<400> 52 

Tyr Gin Leu Asp Pro Lys Phe He Thr Ser He Leu Tyr Glu Asn Asn 

1 5 10 15 

Val He Thr He 
20 



<210> 53 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
<400> 53 

Tyr Gin Leu Asp Pro Lys Phe He Thr Ser He Leu Tyr Glu Asn Asn 

15 10 15 

Val He Thr Ser He 
20 
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<210> 54 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
<400> 54 

Leu Asp Leu Asp Pro Gly Gin Thr Leu 
1 5 



<210> 55 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
<400> 55 

Leu Leu Asp Leu Asp Pro Gly Gin Thr Leu 
15 10 



<210> 56 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
<400> 56 

Gin Leu Asp Leu Asp Pro Gly Gin Thr Leu 
1 5 10 



<210> 57 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
<400> 57 

Trp lie lie lie Glu Leu Lys His Lys Ala 
15 10 



<210> 58 
<211> 10 
<212> PRT 
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<213> Artificial Sequence 



<220> 

<223> Epitope - in ECD 
<400> 58 

Trp lie lie lie Glu Leu Lys His Lys Glu 
15 10 



<210> 59 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
<400> 59 

Ser Thr Cys Trp Cys Val Asn Thr Ala 
1 5 



<210> 60 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
<400> 60 

Ala Thr Cys Trp Cys Val Asn Thr Ala 
1 5 



<210> 61 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
<400> 61 

Tyr Val Asp Glu Lys Ala Pro Glu Phe Ser Met 
15 10 



<210> 62 

<211> 11 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
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<400> 62 

Tyr Val Asp Glu Lys Ala Pro Glu Phe Ser Asn 
15 10 



<210> 63 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
<400> 63 

Gin Asn Asn Asp Gly Leu Tyr Asp Pro Asp Cys Asp Glu Ser Gly Leu 

1 5 10 15 

Phe Asp 



<210> 64 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in ECD 
<400> 64 

Gin Asn Asn Asp Gly Leu Tyr Asp Pro Asp Cys Asp Glu Ser Gly Leu 

1 5 10 .15 

Phe Lys 



<210> 65 

<211> 9 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope in ECD/TMD 

<400> 65 

Ser Met Gin Gly Leu Lys Ala Gly Val 
1 5 



<210> 66 

<211> 11 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope in ECD/TMD 

<400> 66 
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Ser Met Gin Gly Leu Lys Ala Val Ala Gly Val 
15 10 



<210> 67 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope in ECD/TMD 
<400> 67 

Ser Met Gin Gly Leu Lys Ala Val Ala Gly Val Thr Ala Val 
15 10 



<210> 68 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope in ECD/TMD 
<400> 68 

Gly Leu Lys Ala Gly Val He Ala Val He Val 
1 5 10 



<210> 69 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope in ECD/TMD 
<400> 69 

Gly Leu Lys Ala Gly Val He Ala Val 
1 5 



<210> 70 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope in ECD/TMD 
<400> 70 

Gly Leu Lys Ala Gly Val He Ala Val He 
15 10 



<210> 71 
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<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Propeptide epitope 
<400> 71 

Cys Val Cys Glu Asn Tyr Lys Leu Ala Val 
15 10 



<210> 72 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Propeptide epitope 
<400> 72 

Ala Gin Asn Thr Val lie Cys Ser Lys Leu Ala Ala Lys Cys Leu Val 

1 5 10 15 

Met Lys 



<210> 73 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Propeptide epitope 
<400> 73 

Leu Leu Leu Ala Ala Ala Thr Ala Thr Phe Ala 
1 5 10 



<210> 74 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope - in signal peptide 
<400> 74 

Gin Val Leu Ala Phe Gly Leu Leu Leu 
1 5 



<210> 75 

<211> 9 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Epitope - in signal peptide 
<400> 75 

Leu Leu Ala Ala Thr Ala Thr Phe Ala 
1 5 



<210> 76 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-25 signal peptide epitope 
<400> 76 

Gin Ala Leu Ala Leu Gly Leu Leu Leu 
1 5 



<210> 77 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope in TMD 
<400> 77 

Val Val Ala Gly He Val Val Leu Val 
1 5 



<210> 78 

<211> 266 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Tag-25/18 chimera (comprising SP+PP+ECD) 



<400> 78 



Met 


Ala 


Pro 


Pro 


Gin 


Ala 


Leu 


Ala 


Leu Gly 


Leu 


Leu 


Leu 


Ala 


Ala 


Ala 


1 








5 










10 










15 




Thr 


Ala 


Thr 


Phe 


Ala 


Ala 


Ala 


Gin 


Glu 


Glu 


Cys 


Val 


Cys 


Glu 


Asn 


Tyr 








20 










25 










30 






Lys 


Leu 


Ala 


Val 


Asn 


Cys 


Phe 


Val 


Asn 


Asn 


Asn 


Arg 


Glu 


Cys 


Gin 


Cys 






35 










40 










45 








Thr 


Ser 


Val 


Gly 


Ala 


Gin 


Asn 


Thr 


Val 


He 


Cys 


Ser 


Lys 


Leu 


Ala 


Ala 




50 










55 










60 










Lys 


Cys 


Leu 


Val 


Met 


Lys 


Ala 


Glu 


Met 


Asn 


Gly 


Ser 


Lys 


Leu 


Gly 


Arg 


65 










70 










75 










80 


Arg 


He 


Lys 


Pro 


Glu 


Gly 


Ala 


Leu 


Gin 


Asn 


Asn 


Asp 


Gly 


Leu 


Tyr 


Asp 










85 










90 










95 




Pro Asp 


Cys 


Asp 


Glu 


Ser 


Gly 


Leu 


Phe 


Lys 


Ala 


Lys 


Gin 


Cys 


Asn 


Gly 
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100 

Thr Ala Thr Cys 
115 

Lys Asp Thr Glu 
130 

He He Glu Leu 
145 

Ser Leu His Thr 

Asp Pro Lys Phe 
180 

He Asp Leu Met 
195 

He Ala Asp Val 
210 

Leu Phe His Ser 
225 

Leu Asp Leu Asp 

Ala Pro Glu Phe 
260 



Trp 


Cys 


Val 


Asn 








120 


He 


Thr 


Cys 


Ser 






135 




Lvs 

2 


His 


Lvs 


Glu 




150 






Ala 


Leu 


Gin 


Lvs 


165 








He 


Thr 


Ser 


He 


m n 


Asn 


Co v* 

OCT J. 


Oov 








200 


Ala 


His 


Tyr 


Phe 






215 




Ser 


Lys 


Lys 


Met 




230 






Pro 


Gly 


Gin 


Thr 


245 








Ser 


Met 


Gin 


Ala 



105 

Thr Ala Gly Val 

Glu Arg Val Arg 
140 

Arg Glu Ser Pro 
155 

Glu He Thr Thr 
170 

Leu Tyr Glu Asn 
185 

Gin Lys Thr Gin 

Glu Lys Asp Val 
220 

Asp Leu Arg Val 
235 

Leu He Tyr Tyr 

250 
Leu Lys 
265 





110 






Arg 




Thr 


Asp 










Thr 


±yx, 


lip 


J. X c 


lyr 


7\ on 


DCi. 










160 


Arg 


Tyr 


Gin 


Leu 






J- I u 




Asn 


Val 


lie 


Tnr 




190 






Asn 


Asp 


vai 


Asp 


205 








Lys 


Gly 


Glu 


Ser 


Asn 


Gly 


Glu 


Gin 








240 


Val 


Asp 


Arg 


Asn 






255 





<210> 79 

<211> 798 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> DNA encoding Tag- 25/18 chimera comprising 
SP+PP+ECD 



<400> 79 

atggcacccc 

gccgcagcac 

aacaataata 

aaactggctg 

aggatcaaac 

gagtccggcc 

accgctgggg 

acatattgga 

tccctccata 

attacatcca 

cagaagactc 

aaaggtgaat 

ctggatctgg 

teaatgeagg 



ctcaagcact 
aggaggagtg 
gagaatgeca 
caaagtgtct 
ctgaaggagc 
tcttcaaggc 
tgcgccggac 
tcatcattga 
ctgcactgca 
tcctctatga 
agaatgatgt 
ccttgtttca 
atcctggtca 
ctctaaaa 



ggcactgggt 
tgtgtgcgaa 
gtgeacttet 
ggtcatgaag 
tetgeagaac 
caaacagtgt 
cgataaggat 
actcaaacat 
aaaggaaatc 
gaacaatgtt 
ggacatagct 
ttcttctaag 
aactttaatt 



ctgctgctgg 
aactacaagc 
gtgggagcac 
gecgaaatga 
aacgatggtc 
aatggcactg 
accgaaatta 
aaagagcgcg 
actacacgct 
attacaattg 
gatgtggctc 
aaaatggacc 
tattatgttg 



ccgccgctac 
tcgctgtcaa 
agaatacagt 
aeggatccaa 
tctacgaccc 
etacatgetg 
cctgttctga 
agtctccata 
accagctgga 
atetgatgea 
attattttga 
tgagagtaaa 
atagaaatgc 



cgccactttc 60 
ctgtttcgtc 120 
gatctgtagc 180 
getegggegg 240 
cgactgtgac 300 
gtgcgtgaac 360 
gagggtcegg 42 0 
cgattctaaa 480 
tccaaaattc 540 
aaatagctct 600 
aaaagatgtt 660 
tggagaacaa 72 0 
acctgaattt 780 
798 



<210> 80 

<211> 25 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-18 TMD 



<400> 80 



Ala Gly Val Cys Ala Val Ala Val He Val Val Val Met He Ala Val 

15 10 15 

Val Ala Gly He Val Val Leu Val lie 
20 25 



<210> 81 
<211> 53 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Seq Pattern 1 (propeptide alignment) 
<220> 

<221> VARIANT 
<222> (1) . . . (53) 

<223> Xaa = any amino acid or as shown in Table 3 
<400> 81 

Gin Xaa Xaa Cys Val Cys Xaa Asn Tyr Lys Leu Xaa Xaa Xaa Cys Xaa 

15 10 15 

Xaa Asn Xaa Xaa Xaa Xaa Cys Gin Cys Thr Ser Xaa Gly Xaa Gin Asn 

20 25 30 

Thr Val He Cys Ser Lys Leu Ala Xaa Met Lys Ala Glu Met Xaa Xaa 

35 40 45 

Ser Lys Xaa Gly Arg 
50 



<210> 82 
<211> 53 
<212> PRT 

<213> Artificial Sequence 

<220> * 
<223> Seq pattern 2 - propeptide alignment with 
retention of epitopes 

<220> 

<221> VARIANT 
<222> (1) . . . (53) 

<223> Xaa = any amino acid or as shown in Table 4 
<400> 82 

Gin Xaa Xaa Cys Val Cys Glu Asn Tyr Lys Leu Ala Val Xaa Cys Xaa 

15 10 15 

Xaa Asn Xaa Xaa Xaa Xaa Cys Gin Cys Thr Ser Xaa Gly Xaa Gin Asn 

20 25 30 

Thr Val He Cys Ser Lys Leu Ala Val Met Lys Ala Glu Met Xaa Xaa 

35 40 45 

Ser Lys Xaa Gly Arg 
50 



<210> 83 
<211> 23 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Signal peptide sequence pattern 
<220> 

<221> VARIANT 
<222> (1) . . . (23) 

<223> Xaa = any amino acid or as shown in Table 5 
<400> 83 

Met Ala Xaa Pro Xaa Xaa Leu Ala Xaa Gly Leu Leu Leu Ala Xaa Xaa 

15 10 15 

Thr Ala Thr Xaa Ala Ala Ala 
20 



<210> 84 
<211> 33 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> EGF-LIKE DOMAIN 1 SEQ PATTERN 

<220> 

<221> VARIANT 

<222> (1) . . . (33) 

<223> Xaa = any amino acid 

<400> 84 

Cys Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa 

1 5 10 15 

Xaa Xaa Xaa Cys Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Cys 



<210> 85 
<211> 70 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> EGF-LIKE DOMAIN 2 SEQ PATTERN 
<220> 

<221> VARIANT 

<222> (1) . . . (70) 

<223> Xaa = any amino acid 

<400> 85 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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20 25 30 

Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Cys Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

50 55 60 

Xaa Xaa Xaa Xaa Xaa Cys 
65 70 



<210> 86 
<211> 33 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> EGF-LIKE DOMAIN 1 SEQ PATTERN with TAg-25 epitope 
<220> 

<221> VARIANT 

<222> (1) . . . (33) 

<223> Xaa = any amino acid 

<400> 86 

Cys Val Cys Glu Asn Tyr Lys Leu Ala Val Xaa Cys Xaa Xaa Xaa Xaa 

1 5 10 15 

Xaa Xaa Xaa Cys Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Cys 



<210> 87 
<211> 70 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> EGF-LIKE DOMAIN 2 SEQ PATTERN with TAg-25 epitopes 
and RR site 

<220> 

<221> VARIANT 

<222> (1) ... . (70) 

<223> Xaa = any amino acid 

<220> 

<221> VARIANT 
<222> (17) . . . (17) 

<223> Xaa = preferably Ala, lie or Met 
<400> 87 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Arg 

15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gin Asn Asn Asp Gly Leu Tyr Asp Pro 

20 25 30 

Asp Cys Asp Glu Ser Gly Leu Phe Lys Xaa Xaa Xaa Cys Xaa Xaa Xaa 
35 40 45 
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Ala Thr Cys Trp Cys Val Asn Thr Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

50 55 60 

Xaa Xaa Xaa Xaa Xaa Cys 

65 70 



<210> 88 
<211> 109 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Combined EGF domains (all Cys) pattern 
<220> 

<221> VARIANT 

<222> (1) . . . (109) 

<22 3> Xaa = any amino acid 

<400> 88 

Cys Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa 

15 10 15 

Xaa Xaa Xaa Cys Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

50 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Cys Xaa Cys Xaa Xaa Xaa Xaa 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 
100 105 



<210> 89 
<211> 109 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Example of 12 Cys pattern with some epitopes 



<220> 

<221> VARIANT 

<222> (1) . . . (109) 

<223> Xaa = any amino acid 



<220> 

<221> VARIANT 
<222> (56) . . . (56) 

<223> Xaa = preferably Ala, lie or Met 
<400> 89 

Cys Val Cys Glu Asn Tyr Lys Leu Ala Val Xaa Cys Xaa Xaa Xaa Xaa 
1 5 10 15 
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Xaa Xaa Xaa Cys Xaa Cys Xaa Xaa 
20 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys 

35 40 
Xaa Xaa Xaa Xaa Xaa Arg Arg Xaa 

50 55 
Asn Asp Gly Leu Tyr Asp Pro Asp 
65 70 
Xaa Xaa Xaa Cys Xaa Xaa Xaa Ala 
85 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
100 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
25 30 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
45 

Xaa Xaa Xaa Xaa Xaa Xaa Gin Asn 
60 

Cys Asp Glu Ser Gly Leu Phe Lys 

75 80 
Thr Cys Trp Cys Val Asn Thr Ala 

90 95 
Xaa Xaa Xaa Xaa Cys 
105 



<210> 90 
<211> 55 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mature portion of SEQ ID NO: 89 
<220> 

<221> VARIANT 

<222> (1) . .'. (55) 

<223> Xaa = any amino acid 

<220> 

<221> VARIANT 
<222> (2) ... (2) 

<223> Xaa = preferably Ala, lie or Met 



<400> 90 

Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gin Asn Asn Asp Gly Leu Tyr Asp 

15 10 15 

Pro Asp Cys Asp Glu Ser Gly Leu Phe Lys Xaa Xaa Xaa Cys Xaa Xaa 

20 25 30 

Xaa Ala Thr Cys Trp Cys Val Asn Thr Ala Xaa Xaa Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Cys 
50 55 



<210> 91 
<211> 71 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Sequence pattern - Thyroglobulin type-1 repeat 
motif 



<220> 

<221> VARIANT 

<222> (1) . . . (71) 

<223> Xaa = any amino acid 
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<400> 91 

Cys Xaa Val Glu Arg Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa 

1 5 10 15 

Xaa Xaa Xaa Xaa Glu Gly Ala Leu Xaa Xaa Xaa Xaa Gly Leu Tyr Xaa 

20 25 30 

Pro Xaa Cys Asp Glu Xaa Gly Xaa Xaa Lys Xaa Xaa Gin Cys Xaa Xaa 

35 40 45 

Xaa Xaa Xaa Xaa Cys Trp Cys Val Asp Xaa Xaa Gly Xaa Xaa Xaa Xaa 

50 55 60 

Xaa Xaa Asp Xaa Xaa Xaa Glu 
65 70 



<210> 92 

<211> 184 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> TAg-25 fragment comprising ECD N-terminal variant 



<400> 92 



He 


Lys 


Pro 


Glu 


Gly 


Ala 


Leu 


Gin 


Asn 


Asn 


Asp 


Gly 


Leu 


Tyr 


Asp 


Pro 


1 








5 










10 










15 




Asp 


Cys 


Asp 


Glu 


Ser 


Gly 


Leu 


Phe 


Lys 


Ala 


Lys 


Gin 


Cys 


Asn 


Gly Thr 








20 










25 










30 






Ala 


Thr 


Cys 


Trp 


Cys 


Val 


Asn 


Thr 


Ala 


Gly 


Val 


Arg 


Arg 


Thr 


Asp 


Lys 






35 










40 










45 








Asp 


Thr 


Glu 


He 


Thr 


Cys 


Ser 


Glu 


Arg 


Val 


Arg 


Thr 


Tyr 


Trp 


He 


He 




50 










55 










60 










He 


Glu 


Leu 


Lys 


His 


Lys 


Glu 


Arg 


Glu 


Ser 


Pro 


Tyr 


Asp 


Ser 


Lys 


Ser 


65 










70 










75 










80 


Leu 


His 


Thr 


Ala 


Leu 


Gin 


Lys 


Glu 


He 


Thr 


Thr 


Arg 


Tyr 


Gin 


Leu Asp 










85 










90 










95 




Pro 


Lys. 


Phe 


He 


Thr 


Ser 


He 


Leu 


Tyr 


Glu 


Asn 


Asn 


Val 


He 


Thr 


He 








100 










105 










110 






Asp 


Leu 


Met 


Gin 


Asn 


Ser 


Ser 


Gin 


Lys 


Thr 


Gin 


Asp 


Asp 


Val 


Asp 


He 






115 










120 










125 








Ala Asp 


Val 


Ala 


Tyr 


Tyr 


Phe 


Glu 


Lys 


Asp 


Val 


Lys 


Gly 


Glu 


Ser 


Leu 




130 










135 










140 










Phe 


His 


Ser 


Lys 


Lys 


Met 


Asp 


Leu 


Arg 


Val 


Asn 


Gly 


Glu 


Leu 


Leu 


Asp 


145 










150 










155 










160 


Leu Asp 


Pro 


Gly 


Gin 


Thr 


Leu 


He 


Tyr 


Tyr 


Val 


Asp 


Glu 


Lys 


Ala 


Pro 










165 










170 










175 




Glu 


Phe 


Ser 


Met 


Gin 


Gly 


Leu 


Lys 



















180 



<210> 93 

<211> 795 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA encoding hEpCAM antigenic fragment comprising 
SP+PP+ECD (termed "sEpCAM" ) 
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<400> 93 

atggcgcccc cgcaggtcct cgcgttcggg cttctgcttg ccgcggcgac ggcgactttt 60 
gccgcagctc aggaagaatg tgtctgtgaa aactacaagc tggccgtaaa ctgctttgtg 120 
aataataatc gtcaatgcca gtgtacttca gttggtgcac aaaatactgt catttgctca 180 
aagctggctg ccaaatgttt ggtgatgaag gcagaaatga atggctcaaa acttgggaga 240 
agagcaaaac ctgaaggggc cctccagaac aatgatgggc tttatgatcc tgactgcgat 300 
gagagcgggc tctttaaggc caagcagtgc aacggcacct ccacgtgctg gtgtgtgaac 360 
actgctgggg tcagaagaac agacaaggac actgaaataa cctgctctga gcgagtgaga 420 
acctactgga tcatcattga actaaaacac aaagcaagag aaaaacctta tgatagtaaa 480 
agtttgcgga ctgcacttca gaaggagatc acaacgcgtt atcaactgga tccaaaattt 540 
atcacgagta ttttgtatga gaataatgtt atcactattg atctggttca aaattcttct. 600 
caaaaaactc agaatgatgt ggacatagct gatgtggctt attattttga aaaagatgtt 660 
aaaggtgaat ccttgtttca ttctaagaaa atggacctga cagtaaatgg ggaacaactg 720 
gatctggatc ctggtcaaac tttaatttat tatgttgatg aaaaagcacc tgaattctca 780 
atgcagggtc taaaa 795 

<210> 94 
<211> 372 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> nucleotide sequence of a mAb variable heavy chain 
domain 

<400> 94 

gaggtgaagc tgctggagtc cggaggtggc ctggtgcagc ctggaggatc cctgaaactc 60 
tcctgtgcag cctcaggatt cgattttagt agatactgga tgagttgggt ccggcaggct 12 0 
ccagggaaag ggctagaatg gattggagat attaatctag aaagcaatac gataaactat 180 
acgccatctc taaaggataa attcatcatc tccagagaca acgccaaaaa tacgctgtac 240 
ctgcaaatga acaaagtgag atctgaggac acagcccttt attactgtgc aagaggggcc 300 
tatactatgg actactgggg tcaaggaacc tcagtcaccg tctcctcagc caaaacgaca 360 
cccccatctg tc 372 

<210> 95 
<211> 126 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> amino acid sequence of a mAb variable heavy chain 
domain 

<400> 95 

Glu Val Lys Leu Leu Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 

1 5 10 15 

Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Asp Phe Ser Arg Tyr 

20 25 30 

Trp Met Ser Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 

35 40 45 

Gly Asp lie Asn Leu Glu Ser Asn Thr lie Asn Tyr Thr Pro Ser Leu 

50 55 60 

Lys Asp Lys Phe lie lie Ser Arg Asp Asn Ala Lys Asn Thr Leu Tyr 
65 70 75 80 

Leu Gin Met Asn Lys Val Arg Ser Glu Asp Thr Ala Leu Tyr Tyr Cys 

85 90 95 

Ala Arg Gly Ala Tyr Thr Met Asp Tyr Trp Gly Gin Gly Thr Ser Val 
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100 105 110 

Thr Val Ser Ser Ala Lys Thr Thr Pro Pro Ser Val Ala Ala 
115 120 125 
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